The Data Stack Show - 28: Next Gen Data Governance with Stefania from Avo

Episode Date: March 10, 2021

On this week’s episode of The Data Stack Show, Eric and Kostas are joined by Stefanía Bjarney Ólafsdóttir, the CEO and co-founder of Avo. Avo, which started in 2018, provides data analytics gover...nance as a service, helping organizations make data-driven decisions to improve their customer experience.Highlights from this week’s episode include:Stefania's background with mathematics, philosophy, bioinformatics and consumer mobile (2:39)Making pioneering decisions as head of data science at QuizUp (8:34)Is less more? Choosing fundamental parts of the customer experience and understanding them very well (16:56)Bringing data consumers closer to data producers (18:34)Avo mission to provide analytics governance as a service (25:09)Avo use cases (36:37)Focusing on event-based data (44:29)The Data Stack Show is a weekly podcast powered by RudderStack. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

Transcript
Discussion (0)
Starting point is 00:00:00 Welcome to the Data Stack Show, where we talk with data engineers, data teams, data scientists, and the teams and people consuming data products. I'm Eric Dodds. And I'm Kostas Pardalis. Join us each week as we explore the world of data and meet the people shaping it. Okay, Steph from Avvo is our guest today. And the question that I want to ask her is, which this won't surprise our listeners who've been around for a while, but it's about her background. She studied mathematics, did work in bioinformatics, and then went from bioinformatics into consumer mobile. And whenever I see sort of a progression like that, where there's a significant
Starting point is 00:00:55 sort of change in, you know, the industry or business that you work in, it's just always fascinating to me how that happens. So that's what I'm, that's my So that's my main question I want among many others, but that's the main one. How about you, Kostas? I'm really interested to learn from her about data quality. She has an amazing background. And actually, she started like super early trying to solve these problems.
Starting point is 00:01:19 And this journey also ended up by building like AVO. So I really want to learn more about how she perceives data quality, why it's important, how it can be solved. And of course, learn more about how AVO addresses these issues and delivers value to their customers. So that's from my side. I think we are going to enjoy this conversation with Steph a lot. I agree. Well, let's go chat.
Starting point is 00:01:47 All right. We have a guest I'm really excited about. Steph from Avvo. Thank you for joining the show, Steph. Thank you for having me. Super excited to be here. Yes, yes. Well, so many things to talk about.
Starting point is 00:02:03 But as our listeners know, I really love hearing about people's backgrounds and stories and what they did before they ended up doing what they're doing today. And your background is fascinating. So you studied mathematics. I'd love to hear about, you know, hear about that journey. And did bioinformatics and then you jumped into consumer mobile. And so I'm just fascinated to hear how did you go mathematics to bioinformatics makes more sense, but bioinformatics to consumer mobile seems like a crazy jump. And I just would love to hear that story. Yeah. Well, thanks for highlighting it. I guess maybe one interesting aspect that might explain it a little bit. I didn't only study mathematics, I actually did a double and I studied philosophy as well, which makes me extremely curious about people and helping other people answer questions and also just ask good questions so you know that might explain a little bit about or yeah like how I moved into also being curious about just user experiences and people how they
Starting point is 00:03:13 experience games and things like that but well another aspect as well that I just highlighted recently in another interview I moved around a lot as a kid. I had very young parents and I actually had moved around 30 times before I finished high school. Wow. And that also I think makes me extremely curious about people. So I love asking questions. I'm extremely curious. I think people describe me that way. One question I just have to ask. So mathematics and philosophy, you would generally think about those as completely separate. But since you were studying both at the same time, were there any commonalities that stuck out to you in the two different disciplines? Yeah, I love that you brought that up because yes, absolutely. They are surprisingly similar. I feel like the world is sort of split into two groups,
Starting point is 00:04:15 the people who think it's obvious that you would study those two together. And then the people who think it's like, well, that's really weird then and then when you take a little bit into it both of them are very i guess logic oriented mathematics is rigorously logic oriented and has language to describe everything while and trains you up and really you know being logical and deducing logic and and you know finding the truth and philosophy does the same except it doesn't does it via you know written language and you are trained into taking texts of very complex words and vague statements and translating them into what are the things that are the actual assumptions behind this and is it true and what
Starting point is 00:05:06 are the first principles behind it and so I thoroughly enjoyed both but obviously also philosophy stretches a little bit into you know ethics and and sort of thinking a little bit more about people and behavior and society and how we should build our society. And so I really enjoyed that part of philosophy as well. Yeah, it is. It is interesting now that you say it that way and we can move on because I don't want to spend the entire time on this because I know we, I know I could too, but we have to talk about data because we named the show,
Starting point is 00:05:42 the data stack show. It's a, it's a legal requirement. But it is interesting, the concept of sort of having tools and mechanisms that you leverage in order to understand something or find a solution to something. And that's the same with philosophy and mathematics. Absolutely. Absolutely. It's just extremely oriented towards sitting for long times and thinking about very short statements and digging into them and like figuring out what the hell is true behind them. So yeah, I love that. Well, tell us about what did you do in consumer mobile? So you took that background into consumer mobile and what specifically how are you applying your skill sets there? Yeah, exactly. I'll go through them, jump through from, so I went from university to, well, first I traveled around Australia for half a year, which was amazing. And then I came back home and to Iceland, I'm from Iceland and applied for a couple of the sort of industry, which would
Starting point is 00:06:43 probably be called startups, but to me were like just huge corporate organizations. And one of them was a DNA analysis company, I guess, called Decode Genetics, where I had the opportunity to learn from amazing statisticians, world-class statisticians that have released fundamental discoveries
Starting point is 00:07:04 and statistics and imputation regarding how you calculate people's DNA based on sample data or based on their familial information and things like that. So in that role, I did a lot of distributed computing, setting up all sorts of calculations and merging data sets again. And, but I also worked with the people over there that were like doctors and biologists, and they were trying to, you know, their, their, their perspective was, you know, I'm, I'm, I'm curious about the mechanics of a physical trait of a human. And then it was my job to bring in the information about like, how could maybe the DNA statistics
Starting point is 00:07:45 and the data about their DNA have actually impacted that physical trait. So I was correlating physical traits with DNA mutations. And that was really fun. Yeah, I love that. And so I was one of the few people in Iceland at that point in time that had really worked with huge data sets for reals and not just, you know, what you study in school or like, you know, I don't know, because you can train yourself in, you know, writing software and working with big data sets, but nothing really compares to actually,
Starting point is 00:08:19 you know, having to go through the pain of fixing it in reality. Oh, sure. There's such a big difference between stubbed out data for, you know, sort of learning and exercises and like real world data. That's totally different. It's totally, yeah, absolutely. And that jump was really educational for me. And I had sort of started studying then bioinformatics after this job and was actually had gotten offered a position doing PhD also in statistics slash bioinformatics in Oxford and also in Berkeley, which was really exciting. I was excited to move to California as well. And then this mobile game called QuizUp blew up all of a sudden and reached 1 million users in its first week which
Starting point is 00:09:06 was the fastest growing app in the app store at the time and the founders were icelandic and i had some friends that were working in the company and i was like i can do a phd whenever but when will i have the opportunity to work with these really super fun people and building up a data team and like learning so many things about these people that are playing the game and I can build like a recommendation algorithm for what quizzes you should play if you like Harry Potter and things like that so I got super excited about being joining that team as a founding analyst that was sort of the shift basically so I was just like the Icelandic community type of thing it's a small society and just for example our the the coach of the national soccer team in in soccer he is also a dentist
Starting point is 00:09:53 yeah everyone does everything in Iceland so that's I love that. So that was the switch. And that was sort of the backstory behind me joining this company, QuizUp. And this was 2013. And so that was super early product analytics. I don't think the term even existed at that point. And just a couple of developers had thrown together analytics events using Mixpanel, if I remember correctly. Yeah, they were pretty early. Yeah, this was super early Mixpanel.
Starting point is 00:10:30 This was like Mixpanel was still, you know, we had a bunch of people that were just placing like Powered by Mixpanel on their website to get some free events. I don't think Amplitude even existed at this point in time. Segment maybe just launched their famous Hack News post where they released a wrapper around all of your analytics. Yeah, that was like the genesis that time period. Exactly. So when you were Googling stuff, I was just like, what is retention, you say?
Starting point is 00:11:04 What does that mean? And you couldn't really find definitions of it. And so when the board was asking our CFO, who I reported to, about retention after every launch that we made and for every board meeting, I was like, I don't know what they really want. You have millions of definitions of retention that we could work off. And, you know, I don't even know what the benchmark is. I don't even know what they would consider to be good, what they would consider to be bad. So I just spent my time at that point digging into data. But, you know, most of the data was terribly broken and inconsistent.
Starting point is 00:11:45 These few events that they threw together into Mixpanel. So I sort of spent my time just hacking together scripts and you know, you know, making replicas of the operational databases, which were like 16 shards of the user base and Postgres hosted on AWS or something. And it took maybe probably typically two weeks to answer some really basic question. Obviously that was not acceptable. So we ended up fixing that problem by basically throwing people at the problem.
Starting point is 00:12:19 I started scaling up the data team, hiring more people. So more people could be answering questions like these. But eventually, you know, or, and while we were doing that, we were always, for every single feature release, we were begging the mobile developers to add, you know, specific events for some things and questions that, you know, the board always wanted to answer or the CFO or the CEO always wanted to answer.
Starting point is 00:12:44 And it was so interesting the dynamics because we were in the position to have to report on these things, but we had absolutely no control over what data existed. Absolutely no control over that. But in some way, it was still our fault if we couldn't answer the questions, which is a bit of a tricky situation to be in. So that was sort of the birth of my sort of, you know, how I evolved this role, which was really amazing and a great opportunity to be head of data science for this company because it was, I guess, a twofold role. I got the opportunity to really build the data culture,
Starting point is 00:13:22 which is sort of, you know, getting people curious to use data to answer questions and go beyond their gut feeling. I say beyond because I know that gut feeling is extremely important in making product decisions and sort of you have to have some insights and sort of an understanding of the customers and the users. But, you know, it's really important to be able to back it up with data. And it also just makes the buy-in from anyone on the company just stronger
Starting point is 00:13:50 if you're able to do that. So just getting people curious to do that and basically doing some internal marketing on like, you know, hey, here are some cool things we know now because of data. What could you know? And, you know, having regular meetings
Starting point is 00:14:04 with everyone on the, you know, in all of the different departments in the company, trying to figure out what their needs were, trying to just like lobby, lobby for people using data. And that was really fun. So building up the data culture. So that was one aspect, data culture.
Starting point is 00:14:20 And then the other aspect is just everything around the technical infrastructure of data. So making sure we have the right tracking in place, making sure we had good quality of the data that we did have, making sure we had the infrastructure in place to both do self-serve analytics via tools like Amplitude and Mixpanel, but also extensive raw data analysis and building the infrastructure so that the data scientists were able to, you know, we built our own version control for Jupyter notebooks because no tools existed at that point to do that.
Starting point is 00:14:59 But everything is already built around that today. So nobody has to do that again. We scraped at SDK webpages to get some data from there because they didn't have their own APIs. And we built out like Redshift clusters and pipelines to pipe the raw data from all of our data sources into Redshift, which could have been done with Segment today, obviously, or on Particle or any of the CDPs, Rudderstack, all of the amazing tools that now exist.
Starting point is 00:15:32 But none of that existed then. But yeah, so it was hugely, it was a wonderful learning experience. You were pioneering so many things. It was fun, certainly. I learned a million things. Well, I'm interested. So one more question.
Starting point is 00:15:49 I have been very selfish with the time here. And I know Costas has so many questions to ask. And we want to hear about Avvo. But one more question. And I'll ask you to do the impossible task of distilling this down into just a couple quick lessons. But hearing about your experience pioneering tooling that almost a decade ago didn't exist and now operating in a world where product analytics is table stakes for any software
Starting point is 00:16:24 company or sort of high-performance software startup. product analytics is table stakes for any software company, you know, or sort of high performance software startup. And there's so much infrastructure. I mean, that's a, that has been a tectonic shift to not only sort of live through, but also have actively taken part in. So I'm just interested to know what are some of the, you know, the top maybe two things that stick out to you having seen and really been part of sort of pioneering that entire shift in the data space?
Starting point is 00:16:56 Oh, that's a really good question. So obviously I have made very strong opinions here, I guess loosely held because I love being questioned on them and having the opportunity to discuss them because I know that there are just so many people that have amazing experiences. And just like if anyone is listening that has anything amazing to add to what I'm about to say, I would really love to hear that. So reach
Starting point is 00:17:26 out to me at via Twitter or like my email or whatever. But I think there are probably two things that probably two things that are counterintuitive to anyone who is working in product analytics and this sort of fresh in product analytics and was very counterintuitive to me also when I started. And that is this on one hand, I would say that less is more is that that is sort of like a counterintuitive thing. And that's also a very debated point of view, I would say, in data. But from my perspective, it's more important to choose fundamental parts of the customer experience and understand those points very well, rather than having vague understandings of, I guess, a wider set of data. This is one of the things that I think matters a lot. And there are multiple reasons behind it. And I can definitely go deeper into them. And I would love to actually,
Starting point is 00:18:32 but less is more. It's a counterintuitive one. And I think then the other one that I really fight for and I'm very passionate about myself is, and I've had the opportunity to see through in QuizUp and then as a consultant after QuizUp, working with a few companies around the world and like just advising them what they should do next. And then obviously with our customers at Avvo. And that is, I have a deep passion for bringing data producers closer to data consumers. And when I say data producers, in this context, of course, I'm mostly thinking about customer experiences. And so that is often coined as product analytics. And so the data producers of those are the developers, the
Starting point is 00:19:20 product developers that write the code within the applications, the web apps, the mobile apps, or sometimes the backend that actually get events into other systems, into Roger stack, segment, mixed panel, amplitude, heap, Pendo, Firebase analytics, your BigQuery instance, wherever you want to put your user experience event data, the producers are the developers that write the code in the product that triggers an event when the user does something. So those are the data producers.
Starting point is 00:19:55 And the data consumers are the data analysts and the product managers that then use tools like Amplitude and Mixpanel or Firebase Analytics or Looker or Tableau or whatever to look at that data. And what I saw happening in QuizUp, for example, is the data producers and data consumers were entirely separated. And I was just sharing this in the story just before, which is I reported to the CFO and the CEO and the board of directors, and I made reports for them. And their expectations was that I would be able to answer behavioral questions. But the product team had no obligation to sort of let me know when they were about to release something or, you know, ask me for input or feedback
Starting point is 00:20:46 on how they should instrument some data or make sure the data existed. And that meant that when I was asked for information, I had just no way of answering those questions. And obviously, that was, I guess, a major driving force behind this in sort of bringing data producers closer to data consumers. And was the driving factor behind working closely with the CTO of QuizUp and building tools and processes that added way closer collaboration with the product managers, the data scientists, and the product developers for every single feature release in making sure we were really asking ourselves what was the goal of the feature before we even started implementing it so that we would also be able to design what are the metrics that we will use to measure the success of that feature so that we will then
Starting point is 00:21:45 in the third section be able to actually make sure we have the data to measure those metrics and my sort of so you know when we started that process the we had maybe one or two developers in the company that opened charts at some points in their sort of software development life cycle opened up you know and asked some questions but after we sort of made this a fundamental part of the software development life cycle or the product development life cycle i would say like 70 of the developers of the company they started you know they started asking the questions and they would be posting on slack like whoa check this out we just saw like because they would be posting on Slack, like, whoa, check this out. We just saw,
Starting point is 00:22:26 because they would know when the product was released and they would be just waiting, waiting for amplitude data to trickle in or waiting for mixed model data to trickle in. And so they would just go and look up their charts because they knew the definition of the metric because we had defined it together. So they would just be able to point and click a few things
Starting point is 00:22:43 and then open up the chart and share it with a team and ask some follow-up questions. And this again, like this change bringing producers closer to consumers that really impacted the quality because all of a sudden the producers were consumers and they've fundamentally understood how painful it was themselves when the data didn't work. Yeah. And it actually ties a little bit into the other sort of the other less is more angle as well, which is, you know, if you think about data and from this perspective that you plan it as a part of your product release cycle, you realize that it's never a good strategy to decide to track
Starting point is 00:23:23 everything type of thing because you know just like when you're releasing products you just have to say no to things it's really important to focus because you won't be able to do everything well if you try to do everything you will be able to do a lot of things really badly and it's way better to measure a single thing really well and then have the opportunity to ask some follow-up questions rather than having you know bad answers to a bunch of questions love it i could talk about both of those points all day but i have to hand it over to costas because i've been monopolizing the conversation thank you yeah. Take it away. It's okay. I mean, you had an
Starting point is 00:24:07 amazing conversation so far. So I really enjoyed listening to what you were saying, both of you. Maybe we should arrange another recording where each one of us can impersonate the philosopher or something and have a discussion this way. Yeah, yeah, let's do that. I mean, let's make a recording with Geddel, Russell, Wittgenstein, like this kind. Yeah, yeah, let's do that. I mean, let's make a recording with Gerdel, Russell, Wittgenstein, like this kind of question. I think it would be great. I do have to say, hearing a mathematician and philosopher say that your gut instinct is really important, I thought that was really cool. All right. I'm officially done now. I love that. Yeah. Thank you, Eric. All right. I'm going to ask, I'd like actually to start, Steph, by giving us a bit of more
Starting point is 00:24:52 background around the company itself, AVO. I mean, we talked a lot about what happened early in your career and what made you to do the things you are doing today, but it would be great to learn a little bit more about the company itself. So can you share some more information about the company? Happy to, yes. Well, I've given sort of a backstory, but basically, you know, the tooling that we built at QuizUp, the tooling that me as head of data science
Starting point is 00:25:20 and the CTO who was at that point, sort of his background was in mobile development. His complaints were from the developers who hated how frustrating it was to, and time-consuming and like inefficient it was to implement analytics events and particularly how often it failed and how often I would have to go back to them and be like, excuse me, I'm sorry, you did this all wrong. And now I can't answer my questions. And that was just so frustrating. And it was frustrating for everyone, obviously, basically, nobody, nobody likes, I mean, they don't want to spend a lot of time on this,
Starting point is 00:25:55 but they also don't want to disappoint. Like nobody wants to do that. Nobody wants to deliver something that, you know, is then in the end just useless. And obviously the board and the CEO and the CFO, they wanted us to be able to answer questions. So we, you know, being two very passionate people, me and the CTO, we worked really closely together in building tools that automated a bunch of this stuff. We built a central source of documentation for the schemas, basically, that had event descriptions and the types of the properties and all that stuff.
Starting point is 00:26:31 And then we built code generators so that it would just be super easy for Android and iOS developers to just implement analytics events without having to think too much about it. All of the processes that we also built around this design part, the data design part, we called it purpose meetings at QuizUp, was this thing about setting goals and then deciding the metrics and then designing the events based on that. This originally was so that data scientists would be involved in this process from start to finish.
Starting point is 00:27:06 And we would propose the event structures based on the needs of the data consumers. In the end, though, as the data producers became closer to the data consumers, we just saw this really clear shift where, you know, it was just about aligning on goals and maybe deciding together on the metrics. But in the end, everyone just was self-serve designing the data. So the developers were designing the event structures based on the code base and the data, like how the data sort of best practices that we had in place, a quiz of the naming conventions that we had in place at QuizUp, the naming conventions that we had, all those things. So eventually, sort of what I would like to say is, you know, we transitioned from a centralized analyst, you know, I was the person that answered the questions,
Starting point is 00:27:57 to an attempt to have self-serve analytics, where, you know, we wanted everyone to answer questions, but that was super throttled by lack of data literacy and lack of data quality. And so as we built out these processes of ensuring data literacy even before the feature was out, and then the tools to ensure the data quality before the feature was out, we basically also built like, we built, we empowered our self-serve analytics by building the tools and processes for self-serve analytics governance. And that is something that then, you know, I started taking for granted. But fast forward a couple of years later, I, QuizUp had been acquired and I had started a company with a couple of friends. And it was only about five months into
Starting point is 00:28:46 that company where we actually shipped a product update that was based on incorrect data and it was just like a punch in my gut and I was like oh my god I'm back to back to like five years ago or whatever and I will never be able to work in digital products because it's always gonna suck and I had this realization moment where, oh my God, you know, all the tools that we built at QuizUp, what are we going to do without them? And, you know, we're trying to take this product to market and it doesn't make sense for us to spend time on building those tools again. I can't believe it. And that's what I started sort of syncing with some of my colleagues from the industry,
Starting point is 00:29:22 people from Spotify and Twitch and Airbnb. Turn turned out that we were super early at QuizUp in building these tools. We were building them in 2014, while like Spotify built a compatible thing in 2016. And I was like, what? I can't believe that. That's super cool. And that was sort of a trigger for us to really build Avvo and start sort of bringing these solutions to the general business, the general digital product. And so that's the backstory. But what Avvo does today is Avvo is analytics governance as a service. We are here to pay down the analytics debt of Silicon Valley, which is like, there's a lot of debt going on. And we're here to really optimize this release process for the analytics workflow. And we're bringing data producers and data consumers together with our product managers and developers and data scientists collaborate to plan and to validate and to implement their analytics for every single feature release.
Starting point is 00:30:33 And it fits entirely with the sort of the Git product release workflow. the pioneers for building a branch workflow for your tracking plan, which typically a lot of schema management tools have had versions, semantic versionings for your tracking plans or for your schemas. But that's something that's scary if you're not a data engineer, basically. And one of our earliest learnings was we really need to support the data consumers well, and they're not necessarily data engineers. They can be product managers that effectively only understand what their goal is with the feature, but don't have a deep understanding in event structures and good event structures or semantic versioning. And so this is our mission, analytics governance as a service, bringing self-serve analytics governance to empower really that shift to self-serve analytics,
Starting point is 00:31:34 which is a fundamental piece for companies to succeed today, I would say. Absolutely, I totally agree with that. Can you also tell us a little bit about, I mean, when the company was started, when you launched the first version of the product, like your experience with the early traction, your experience with fundraising in this space? I think that would be amazing for people to hear about. Yes, absolutely.
Starting point is 00:32:02 I'm happy to share a little bit about that. We are very customer-centric as a group of people. So we care deeply about our customers. Like I was talking about earlier, we have strong opinions in what we're doing. However, it's really important for us to confirm that by talking to our customers and looking at the data for how they succeed. And that's how we have built the product from the beginning. We have had customers from effectively day one when we started building our prototypes. We also got into YC. So we are a Y Combinator company. And that was because Gustav Ahlströmer, who used to, or was, I think, one of the early growth persons, I guess, in Silicon Valley.
Starting point is 00:32:49 He was a growth lead or a growth product manager or product manager for growth. There are all of these variations of the growth titles at Airbnb. So he, you know, did like was a fundamental piece in the growth strategy for Airbnb. And he experienced this problem immensely. So when we did our Y Combinator interview, he was like, I've had that problem. And so we did Y Combinator, which was, of course, an amazing adventure. And following that, we raised funding around in Silicon Valley. Before that, we had also taken in some funding from
Starting point is 00:33:25 Icelandic venture capitalists, but also angels that sort of supported us also because we're, you know, it was a team that had a lot of, I guess, I'm lucky to be have been working with a great team in QuizUp, obviously. And so that was a reputation that we had built also in, in, in Iceland. And so even though this was a very pioneering idea at the time, and nobody was talking about this, when we started doing this,
Starting point is 00:33:56 nobody was talking about it. And so, but they sort of, you know, they, they, they funded the team, you know,
Starting point is 00:34:03 they, they, they believe that we were a team that would be able to ship stuff and build stuff. And so that's exciting because it turns out they were right. Yay. And then after Y Combinator, we raised a round also from a company or a fund called Heavybit. They are an accelerator and a fund for developer tools. They were founded by James Lindenbaum,
Starting point is 00:34:30 among other fantastic people, who was the founder of Heroku. And James's perspective on developer tooling has really helped a lot of fantastic developer tools go to market. For example, CircleCI, Stripe, just endless, go check them out, heavybit.com. Fantastic group of people that have been really helpful. And then GGB Capital, fantastic partners there as well.
Starting point is 00:34:52 They've funded giants as well, such as Slack. And our partner there, Glenn Solomon and Oren Younger, they've just been immensely helpful as well, and the entire GGB Capital team as well. And I think they're also very passionate about both developer tooling and the data space. And so that was sort of their drive. They also work pretty closely with the Heavybit team often. So I guess that was sort of a hand-in-hand type of thing.
Starting point is 00:35:20 And we're lucky to be working with these world-class backers of developer tooling and in the data space that have huge experiences in both developer adoption and enterprise sales. So I have them in my inbox all the time and I'm able to ask them for advice. So I'm really fortunate and lucky about that. Yeah, that's amazing. And I understand how you feel about feeling like fortunate, but I'm pretty sure that all these people got involved also because of the team and the passion that you have about solving the problem. So the reason that they are around is also because of you. I mean, you personally and the rest of the team. So congrats for that. That says a lot about the quality of the team, the company and the product.
Starting point is 00:36:07 Thank you for that. Those are kind words. Thank you. All right. So moving forward, let's discuss a little bit more about the product. So our audience can learn what AVO is about and how it works. And let's start by what's the value that someone gets from Avvo today? I know you have touched this previously in the questions that both Eric and me asked.
Starting point is 00:36:30 But yeah, let's try to make it a little bit more concrete about the value proposition that Avvo has as a product. Yeah, thank you. So ultimately, we see a lot of our customers talk about how much developer time they save. Obviously, they save a lot of data quality or like just they rescue their data quality and they start paying down their analytics steps, I would say. And that's the path for, for example, companies like Rappi, which is a huge Latin American company that is like a DoorDash and a Lyft slash Uber slash Instacart. All of those combined, basically just delivering whatever you need as a service. And obviously that exploded in sort of COVID and March, April.
Starting point is 00:37:20 And that is effectively sort of around the time when they reached out to us. And they had been trying to audit their data for months. They had been working on that. The BI team had been trying to sort of catch up on even just understanding what was currently wrong with their analytics before they wanted then even also to try to fix it. At the same time, the developers were just tearing their hair out because of the inefficiencies of implementing these analytics. So their journey was they installed Inspector,
Starting point is 00:37:53 which is our sort of data observability product. You install the Inspector SDK to inspect and observe your current state of tracking ongoing. And that's how you get sort of your, you get a snapshot of your analytics depth there and are able to fully prioritize what is wrong with it today
Starting point is 00:38:13 so that you can, so you can prioritize what to fix, like prioritize the top things that you really need to work, like three, four, five events that you really want to make sure are okay. And then you can just like gradually step by step pay down your analytics adapt. So that that's their first step. And then they team by team adopt the AVO workflow, as we call it. So the AVO workflow involves all
Starting point is 00:38:40 of these stakeholders that I've talked about. The product manager comes into Avvo, opens up a branch to plan the analytics alongside a feature release. The data scientist and the engineer, they sort of give it a review, request some changes potentially. The engineer from the perspective of like, yes, I can implement this analytics event in this format. And the data scientist brings in sort of like the bird's eye overview
Starting point is 00:39:09 of the company analytics and is like, yeah, these analytics structures make sense in comparison with the rest of the company. Great, let's move on. And once that's been approved, that's the third step where the developer pulls generated code. We generate type safe analytics functions. We call them functions where you can implement using, you know, avo dot, you know, item added to cart or whatever, instead of calling a generic analytics SDK, where you have to pass in a string for the
Starting point is 00:39:40 name of the event, and then a huge object blob with like a name of properties and a bunch of property values. Instead of doing all of that, you just call a type safe our function that you generated using your avo CLI, based on the tracking plan, which is all defined in a user friendly UI by the PM and the data person. And that's the third step. And then the fourth step is, you know, the developer, you know, treats the build just like they would for any other build. They just go through their product before they release the build, before they sort of release the feature update. And then they use the in-app Avvo debuggers, where you have like a bubble that shows you
Starting point is 00:40:19 when you're triggering the R analytics event. So you can confirm the timing is consistent across all of the platforms. You can confirm the data structures are exactly as you intended for them to be. The PMs and the analysts also love the NF debuggers and they use them even, you know, outside of the regular feature release process because they can really get a full understanding of like what, what really is being tracked in the product. And so after that
Starting point is 00:40:45 sort of is is sort of quickly checked off the branch basically gets merged the ava branch that happens just at the same time as you merge your git branch and your tracking plan update get out gets automatically published into whichever other source you need schemas in. So that could be your downstream, you know, your BigQuery table somewhere or your Redshift table or your protofile or your amplitude govern or your segment protocols or your mixed panel lexicon, just wherever you might need to manage your schema, you can then automatically publish that after that branch has been sort of finally reviewed and implemented and just ready for production. And that's when, yeah, you know, then just everything is released. And like this, this entire cycle, the workflow,
Starting point is 00:41:38 it's shortening the time that people spend on this by, you know, incredible amounts. So for example, Patreon, they reported, basically they are getting 90% of their time back that they used to spend on this for every single feature release. They used to spend one to four days on this back and forth and like a QA process and syncing between iOS and web or whatever and data. And now they consistently use maybe like 30 to 60 minutes on it. And so that's obviously a huge time saving. Yeah, that's amazing.
Starting point is 00:42:12 Actually, something that I want to add in what you said, Steph, is that my feeling based on my experience so far is that the most successful B2B products, they deliver innovation in two directions at the same time. One is, of course, like the technology and the product itself and how it solves technical problem, for example, in this case. But the other thing which is equally important, and not everyone is doing it, is that it's organizational innovation, as I would say, which means that it helps the company to reorganize
Starting point is 00:42:46 around more efficient processes or automating processes. And I think that when you combine these two together, you have at the end like an amazing both value proposition for the company and the product, but also you have a building an amazing mode for your company. I'd love to chat more about this. I mean, it's something that I'm a bit obsessed to be honest, and something that I'm always trying to figure out. And there are some companies that they do that, like they did that, like Slack did that.
Starting point is 00:43:18 Looker did that. That's exactly what the LookML plus the visualization layer was doing and creating like a very clear and efficient distinction between the people who do the modeling of the data and the people who are consuming the data, like exactly what you're saying about the data producers and consumers. But anyway, that's a very big topic that we can keep discussing. We'll bring it up in our Wittgenstein. Absolutely.
Starting point is 00:43:42 Absolutely. We can do that. So, all right, moving forward, let's talk a little bit about the focus around the data that you have. I mean, like a company generates a lot of different types of data. I mean, from logs that might be consumed, for security reasons, to clickstream data that are actually described like they are like breadcrumbs let's say of the customer behavior it looks like you are focusing more on the second case like you you are mainly working with product analytics and the clickstream data that customers are generating do you want to expand a little bit more on that like why you think that's more important
Starting point is 00:44:21 if it is more important and what are your plans general? Like as you move forward with the product, are you planning, for example, like to support something else? Yeah, these are really good questions. So I think the first part here is the why. Like why are we focusing on customer data? Why are we focusing on product analytics? And from my perspective,
Starting point is 00:44:40 I guess like obviously what impacted is I joined as an analyst applying for a role to report to the CFO. The CFO obviously thinks about finance and cares about finance. And that's really important to them. But at the same time, this was, you know, 2013, when, you know, companies like Zynga, Spotify, Facebook, these companies had just taken over the, they were the digital products, like, right, they were the companies that sort of showed us that we can have digital experiences, solving things that we used to solve in person, right? Games, listening to music, Airbnb, another example, where we used to call and book hotels, But all of a sudden, we had these really wonderful online experiences through going through a product, going through an experience, and just booking a place to stay. And this shift was really going straight. I mean, it started like 2005, 6, 7, when we started seeing all of the apps happening and like just more of the things that we needed to do as people, as humans, just it became more and more mainstream for us to be able to do that online as products, either on web or mobile. Obviously, like software development has been around for a while and we had the dot com boom.
Starting point is 00:45:58 But this sort of shift really started happening at that point in time, I would say. We were going into digital products, digital experiences. So even while I was reporting to the CFO, their biggest problem wasn't financial data. Their biggest problem was really impacting, like understanding and being able to report to the board of directors, the leading indicators for the revenue. Because revenue, I think most people have sort of agreed
Starting point is 00:46:28 to in this space right now already, revenue is a lagging indicator. It's not a leading indicator for your success. While your customer experience can act as a really important leading indicator for your key performing indicator, which is typically revenue or some sort of other way of measuring that you are bringing value to your customers and that your business will continue to be able to thrive and sustain itself. And so, I mean, that's obviously my background into this and why, you know, I thought that this was one of the most difficult things to report on was this experience analytics. So what we focus on at Avvo is we basically focus on event-based data. It's typically event data
Starting point is 00:47:13 that describe user experiences, because those are the things that really sort of are the leading indicators for companies' successes today, for digital product successes today. And I would say today, if you're a company and you don't have a strategy for your digital presence or for your digital product it's likely that you won't survive and so I think this is just a fundamental piece of business strategies for right now already and we'll see like the laggards just falling behind the train in the coming years if if they won't catch up and keep up so that's really one of the really main drivers and just also to take an experience like an example you know one of our customers like patreon they've been with us for a long time they
Starting point is 00:47:57 were one of the sort of one of the earliest sort of like one of the earliest larger organizations to take a back pat on us and we have a fantastic relationship with them they're fantastic mora and jason and everyone over there at patreon it's been fantastic to work with them and they know so many things about this space um and one of the things that mora has already shared with me she talked so much about this like obviously revenue matters to them but the performance of the user experience in the customer journey that also matters a lot to the financial teams and they already use that in their modeling to be able to sort of have some leading indicators for their
Starting point is 00:48:38 success so and then if i tie this to like how do we see this evolve for Alvo I mean there's no limit to what we can support customers in you know fully structuring their event-based data however I think the biggest challenge and touching it like tying it back to what you were saying in like change management and organizational shifts and how people build their organizations and their their sort of focus i think that is really around like the customer experience so that's a deep passion for me and in helping people sort of do that better and so i think while we haven't fully solved that problem we're going to focus on fully solving that problem i would say that's that's some great points, Steph. And yeah, I mean, I think we are going through like a major cultural change inside companies
Starting point is 00:49:29 when it comes like to how data and KPIs and all that stuff are perceived. And I think it's going to be super interesting to see how these things will change in the next couple of years. One last question from my side, then I'll hand it to Eric. You mentioned at the beginning that one of the lessons learned for you was that when it comes to data, less is more at the end.
Starting point is 00:49:52 Can you expand a little bit more on that? Because I think it's going to be super interesting. Yes. So, again, I think like what I mean by that. So I touched on it a little bit before with like when I was talking about, you know, how important it is to bring data producers and data consumers closer together and sort of bring this whole process really into the product development lifecycle. What the mistakes that I've seen people do all over, like again and again and again, is when they decide to track everything quote unquote and you
Starting point is 00:50:27 know this mandate can come from like it can come from the developer that is just building the product and they have a mandate to add some analytics for it and so they're like yeah let's just track everything it can also be something from like a data person or or or a cfo type of person that really just doesn't understand or or isn't haven't they haven't just taken the time to think through what are the fundamental steps in the journey that we really need to understand the friction point for so they give this mandate to track everything and this is just the worst track that i that i see people go through again because of the same reasons why we can't just build everything, right? So building everything, building good products means focus. It means
Starting point is 00:51:14 saying no to things. It's difficult to say no to things, but we really need to do it because if we don't, we will either never ship anything because it just takes so long and we'll never have anything ready. Or we will just do a lot of things really badly, right? They just won't work and they won't have a delightful experience. And you know what it is today in building product. It's like delightful experiences are the fundamental piece in you succeeding and bringing your product to market. So the same thing really applies to data. Like if you just try to track everything, you won't have time to do it. And you just won't, you won't, there's no such thing as tracking everything. And instead, what you'll end up with is like, you'll track random, the random things that you were able to squeeze in before you had to press the release
Starting point is 00:52:00 button. And that's really bad, then you definitely, or it's highly unlikely that you have the most important things tracked if you go that route. It's really highly unlikely. And obviously here, I'm talking about sort of custom tracking. I'm talking about manually adding events. And that brings me a little bit into the subject of like the auto tracking. And I know Rudderstack supports auto tracking. We have some fantastic products that do a lot of auto tracking like Pendo and Heap and Rudderstack and Full Story, a bunch of auto tracking products. The challenge behind this, like I often put, like I often sometimes, sometimes at least I recommend to like PMs or, or, or people who just don't have time yet to think about their data. I'm like, yeah, sure. Go ahead.
Starting point is 00:52:52 Install this. It might give you some insights and it's like a safety net. I understand that you'd rather have something than having nothing. However, it really just postpones the problem of really thinking through what is the information that you need to answer. It postpone it so that you will have to be digging through a lot of noisy data after the fact. And it's really difficult for non-experts or even just like non-developers to understand what data they are looking at, because typically the data points are sort of like they're named after some IDs that the autottracker found through the DOM. And it's a lot of work to figure out what this means. And even if you do, it's not even guaranteed that you have the tracking that you need because good analytics events,
Starting point is 00:53:35 they typically have event properties and segmentations in place that you won't have automatically. And that's that thing like designing good data structures. And so even if you go through the auto tracking process and you have your safety net, it's just, it's going to be difficult to sift through it. It's like looking for a needle in a haystack. Certainly not the way to go if you want to build yourself your analytics and sort of increase the company's trust in data and sort of their trust in themselves to be able to go into a point and click UI and just like figure out what they want to answer. And
Starting point is 00:54:13 also it's not even a guarantee. It's not even a safety net. It's not even a guarantee that you will have the information that you need. So these are my two hot takes, like tracking everything, the tracking everything perspective. So, and because this is really difficult to do, like tracking a lot and tracking everything is really difficult to do. My general recommendation is just start small. Like if you haven't started adding tracking yet, choose three events and stop there. Like just absolutely stop there. And I know you want to track everything, but absolutely stop there and i know you want to
Starting point is 00:54:46 track everything but just stop there because it's way better to to ship something and ship a small slice and be able to start building out that sort of like oh interesting oh that was fun it was fun how we were able to make a decision based on that rather than just like turning like tracking into accidentally for two sprints you didn't ship any tracking because you just decided to track too much and you didn't really have time to finish it so that that's a little bit on the sort of less is more perspective I have way many way way more thoughts on that as well but I think that sort of captures a couple of the essences of that, I think. Well, unfortunately, I think we're close to time. But also, I think we definitely wouldn't have enough time if I asked more questions.
Starting point is 00:55:34 But I have many, many more. But we can save all of that for our philosophy role-playing episode, our special philosophy role-playing episode, our special philosophy role-playing episode. I learned so much stuff. And I think the, I just think our listeners will really appreciate the lessons that you have brought us on this episode based on your vast experience. It's just really helpful. And I think there are lessons that apply to companies or data professionals that are just starting out and really large organizations. And I just always appreciate wisdom that time-tested wisdom that's sort of applicable to all of our listeners. So really, really appreciate you being on the show.
Starting point is 00:56:18 Sounds like you're doing amazing things with Avvo. So excited for you to continue there. And we'll schedule a time to have you back on the show in the next three or six months. Thank you. Thank you for having me. It was super fun. I can't wait for our role play. Yes. Okay, thank you all. Well, that was an absolutely fascinating conversation. I think one of the things that really stuck out to me was how balanced of a perspective that Steph has. And I think the statement that just really, really stuck with me was, she said this almost in passing, but she mentioned how important your sort of gut sense is in making decisions about a product. And just hearing about her background and how much work she has done in data and data
Starting point is 00:57:15 tooling and just the sort of the massive amount of understanding and capability she's developed, yet she has just seems to have such a mature intuition about about how to do things well and not necessarily just get lost in the data or put that above absolutely everything else so i just really appreciated the perspective what was what was your big takeaway costas well we should for sure need to do this role-playing with philosophers role-playing episode. That's going to be a lot of fun. Out of this, what I keep from our conversation with Steph is how important culture is when we are dealing with data. And as we said during our conversation with here, I think we are at the beginning of a tectonic move or change
Starting point is 00:58:06 what is happening inside companies in terms of how they perceive data and their value. With a very good example that Steph gave about how growth is perceived even from the CFOs and how even the CFO's perception
Starting point is 00:58:23 has changed and non-standard KPIs like retention and customer satisfaction is becoming like a leading KPI even for them. And yeah, I'm really looking forward to chat with her again in the next couple of months. Avvo is growing really fast and I'm pretty sure that like in a couple of months from now,
Starting point is 00:58:44 we will have even more to learn from her and her team. I agree. Well, thanks again for joining us on the Data Stack Show. Be sure to subscribe on your favorite podcast tool to get notified of new episodes every week. We have some really, really exciting shows coming up in the next few weeks that you'll want to make sure to listen to. Until next time, that you'll want to make sure to listen to until next time.
Starting point is 00:59:05 Catch you later.

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.