Drill to Detail - Drill to Detail Ep.65 'Bootstrapping, Growth Hacking and Supermetrics Data Pipelines for Digital Marketers' with Special Guest Mikael Thuneberg

Episode Date: May 13, 2019

Mark Rittman is joined by CEO and Founder of Supermetrics, Mikael Thuneberg, to tell the story of how a mention in the official Google Analytics Blog and a prize of a t-shirt led to him founding and b...ootstrapping a €2M ARR marketing analytics business that’s probably the most important software vendor the Drill to Detail audience has never heard of, and who recently moved into the data pipelines-as-a-service market in collaboration with Google Cloud Platform and the Google BigQuery team.Announcing Supermetrics for BigQuery: Get a marketing data warehouse up and running in minutes (Supermetrics blog)Supermetrics, Google BigQuery and Data Pipelines for Digital Marketers (Rittman Analytics blog)

Transcript
Discussion (0)
Starting point is 00:00:00 So hello and welcome to Drill to Detail, the podcast series about modern cloud analytics and the people and products making the news in this industry. So I'm your host, Mark Rittman, and I'm pleased to be joined in this episode by Mikael Thunberg, CEO and founder of Supermetrics. So Mikael, thank you very much for coming on the show, and it's great to have you here. Thanks. Great being here. So Supermetrics, I've often said that you're the BI company, the biggest one people have never heard of, really, certainly within my world or the world I came from, because you kind of work really with kind of marketing users, marketing personas and so on. So maybe just tell us a little bit about who Supermetrics are
Starting point is 00:00:49 and I suppose the route you had into actually founding that company. Yeah, so many years ago in 2009, I was working at a gaming company in the marketing department. And part of my job was to create reports for various parts of the company. And a large part of my time went into gathering data for these reports. So mainly copy pasting data from sources such as Google Analytics into Excel.
Starting point is 00:01:19 And that was really, really time consuming and really boring. And I figured there's got to be a better way to do this. So I went to look for ways to automate that data gathering process so I could do my own work more efficiently. And in 2009, the Google Analytics API just came out. And I figured that there's got to be a solution for getting data out of that API into Excel. And I did some Googling and I found a forum post by a guy at Google called Nick Mihailovsky, saying that the first person to link that new API with Excel will get a free Google T-shirt.
Starting point is 00:02:01 Actually, there was one guy, a Swedish guy, who had already responded and said that they have a solution but Nick couldn't get that to work and neither could I so I figured that I need to create my own solution I was actually not a coder at all so I had some limited experience of doing Excel macros that was that was pretty much it but I decided that that can't be that difficult to get that working. So I spent a few hours coding and I made a simple VBA script that goes to that API, gets that data into Excel. And it was a pretty good solution. So I responded to Nick Michalowski on that forum and said that I have this solution here and can I get my t-shirt now and then Nick tried it out and was happy with that and he did send me the t-shirt
Starting point is 00:02:53 and Google toothbrush as well but more importantly he then blogged about the solution I had done in the Google Analytics blog and that gathered or generated a lot of interest towards my script and a lot of people from all around the world started contacting me saying that okay you have made this script that is awesome can you do something on top of that that script and I started considering like I had never really thought about becoming an entrepreneur, but the opportunity just felt really, really good. So after a few months, I left the job and I formed my own company to work on Google Analytics API solutions to start with.
Starting point is 00:03:47 And then gradually I then expanded to other data sources. Okay. One of the things that certainly when I moved into this world and from more kind of, I suppose, enterprise analytics before that, I was surprised at how big a deal Google Analytics is to marketers. I mean, it is such an important data source. And as you say, you know, it's not always been the easiest to get data out of there. I mean, so maybe just kind of paint a picture of how important Google Analytics is to people
Starting point is 00:04:15 and why getting data out, as you did, was such a kind of big deal. Well, yeah, obviously, Google Analytics is used in millions and millions of websites all over the world. And it's a really good tool. So you can analyze pretty much anything you want on your site with Google Analytics. But I think there are still limits on the reporting UI that Google provides out of the box. So for a lot of advanced use cases, you would like to get the data out of GA and put it somewhere else where you can really analyze that data yourself and combine that data also with other sources.
Starting point is 00:05:02 So there's a big market for any API integrations in the GA that make that possible. Okay. And another thing that I was surprised about was it's not always easy to get data out at the most granular level of detail. It's sampled and so on. Is that still the case now? I mean, how does sampling work with that? And again, how did your solution work in that sort of area? Yes, so with GA, we work with aggregated data. So, of course, GA does have a hit-level export into BigQuery as well for premium customers. But we work with the API, which is aggregated data. That is sufficient for the vast majority of use cases. Google does sample the data,
Starting point is 00:05:48 but there are certain ways to get around the sampling, and that's been one of our strengths on how we can avoid this data sampling that Google performs. Exactly. So I suppose the other thing really is, again, having worked with your product, and again, another reason I was interested to speak to you was it's, you know, Supermetrics is a product that I see in almost every customer site I go to, really, where they're kind of evolving from maybe being a startup. And that's the first data source they would use.
Starting point is 00:06:15 And they'd use Supermetrics to bring that in to then using your products to bring in, say, Facebook ads and so on. So maybe again, just paint a picture of what, I know there's quite a few products that you have, but what broadly is your coverage in terms of kind of data sources and I suppose this sort of jobs to be done that you support as well? Yeah, so we cover a wide variety of marketing data sources. So advertising platforms such as Facebook ads, Google ads, Bing ads, Twitter, LinkedIn, and many others. Then we have web analytics tools, GA obviously, we have Adobe. Then we have databases of various types. We have Salesforce, Stripe, PayPal, and various other sources, but mainly with a marketing focus. Okay. And who would be the typical, I suppose, persona of someone who'd use your products? Who would be the kind of person you mainly cater for or envisages your customer that you work with?
Starting point is 00:07:17 Yeah, so it's gradually evolving, but traditionally it's been, the tool has been, so if I start with where the tool has started out as. So we've been obviously focused on spreadsheets for a long time. So the company started out with getting data into Excel. But since then, we created a Google Sheets add-on that has become the main product for the company and product of which we are best known all around. So getting data into spreadsheets, which we see that a lot of marketers want to do because nothing really beats a spreadsheet when you want to analyze data and combine it in a very flexible ways.
Starting point is 00:08:03 So that's been a really, really good niche for us, getting the data in the spreadsheet. And in that market, it's usually an analyst or the marketer himself or herself that is purchasing the tool and then using the tool. Okay. So how was, I mean, if you're building a product for that kind of audience how how do you think about i mean you say you deliver it as an add-in into
Starting point is 00:08:29 i know there's quite a few products you do and it'd be quite good to talk about some of the other ones later on but when you when you're thinking i mean the way i came across super metrics was i was looking in the the g suite sort of uh marketplace and i saw it listed there amongst the other kind of analytics tools and analytics sources um you know so is that is that your primary kind of I suppose channel for selling or how do you go about you know selling and building products for marketers really? Yeah so obviously when the company started out like I said Google blogged about the solution many years ago that gave me the initial traction. And since then, Google has been really, really helpful. So we have a tight relationship with the Google Analytics, Google Data Studio, and also the
Starting point is 00:09:12 BigQuery teams nowadays. So they provide a lot of visibility for us. And that's been important for getting people to use our products, both through their social media their blogs and their application galleries but on top of that we do a lot of content and influencer marketing as well okay okay and i suppose in terms of the product it needs to be something that is i wouldn't say non-technical because i mean most marketers i i know are are competent with the tools that you use there but presumably it has to be a fairly kind of turnkey solution really and something that is focused on like immediate value I guess really. Yeah definitely so myself I'm not by background a technical person so how I try to design the product is for a person like me that is working with a lot of marketing data
Starting point is 00:10:06 uh it's not really a coder uh it's somewhat somewhat like technical skills can can do a lot of stuff in a spreadsheet for example but but uh but still not not really a really a developer um and that's always been been my aim with the products to make them simple for normal marketers to use without the help of any IT guys. Okay, okay. And you've managed to grow. I mean, I was looking on Crunchbase actually before speaking to you, and I think you're one of the most successful Finnish startups and so on.
Starting point is 00:10:39 What kind of size have you got to really, and how big is the team and that sort of thing, the number of customers and that sort of thing yeah so we are approaching 50 people in the team uh we now have offices in in finland where we are where we are we are coming from and we recently opened a development office in lithuania and we are looking into opening in the US later this year. We recently passed 10 million in annual recurring revenue. We have thousands and thousands of companies all around the world using the products. Half of them are marketing agencies
Starting point is 00:11:21 that use them to analyze the campaigns they do for their clients and optimize those campaigns. And then the other half are direct customers such as Warner Bros., Dyson. Actually, Google itself is a client as well. And many other big names. Okay. That's fantastic. I mean, so your products have evolved over time but so you started off with you say excel and it's in google sheets now and that so on i mean and then you've i think there's some support you've got for google data studio as well i mean
Starting point is 00:11:54 again you know i think google data studio is often the kind of bi tool that is the biggest one that no one's heard of within the kind of the the old school world um uh why why did you focus on on maybe to explain what Google Data Studio is and what did you do with that product, you know, to bring Supermetrics to that? Yeah, so Google Data Studio is Google's BI dashboarding tool, pretty new one. So I think they released it or made it generally available like two years ago or something. So it's a pretty new thing.
Starting point is 00:12:28 Feature-wise, it's not quite on par yet with some of the established players, but what we see is people really love the ease of use and all of the sharing
Starting point is 00:12:42 options there. So it is gaining, I would say, a lot of users and maybe at the expense of some of the existing players on the market, because it's very easy to use. And I think also one aspect has to be that it's free. So that's definitely part of it, but it's not the only thing. It's also a really, really good tool to use. And we also use it here at Supermetrics ourselves.
Starting point is 00:13:14 And yeah, so a couple of years ago, then Google, they had already released Data Studio and then they contacted us and said that, you know, we are going to open up a data studio for data integration companies like Supermetrics. So I asked if we would like to be the first company to develop connectors for bringing non-Google data into Google Data Studio. And, yeah, of course, we took that opportunity and started working with Google on that and then released that functionality in August 2017.
Starting point is 00:13:56 We were their launch partner with the connectors, along with a few other companies. But I think we had the vast majority of the connectors along with a few other companies but uh i think we had the vast majority of the connectors available and i think we still have a uh the strongest uh strongest uh suit of connectors uh for that product and okay okay so so so how i mean just i mean i know you obviously you're not you're not the technical person behind it and so on but how do that i mean in broad in broad terms how do the connectors work what do you i mean they connect via an api for example to to the various data sources and and they come back is
Starting point is 00:14:30 it via batch i mean in broad terms how does that how do your connectors work really yeah so actually i am i am the technical connectors i have to say uh so that was still, I would say, the last product where I was really, really hands-on with the coding. Obviously, now we have many more talented people in our team to do that. So Data Studio has this certain framework for connectors. They let us specify a config to show to the users what kind of selections the users need to make in order to get that connector working.
Starting point is 00:15:19 Then we tell Google Data Studio what is the schema for the connector. So we have on our side a list of fields that the connector supports. We give that to Data Studio. And then Data Studio lets users, when they do reporting, they pick fields out of that schema. And they give us a get data request that contains these fields and various other parameters, such as the date range that the user has selected. Then we make a request with these parameters to the source API, Facebook ads, for example, get the data,
Starting point is 00:15:57 then process that data in our backend into a standard format, and then return that data for Data Studio to display for the user. Presumably you've got tens of thousands or more of customers doing this at the same time. How do you handle doing this at the scale that you do it at? Yeah, it can be a challenge. Obviously many of these data sources have rate limits, quotas, various things that make it difficult to scale
Starting point is 00:16:27 these connectors when you start getting a lot of users and we have put a lot of effort on handling these massive amounts of users so we do a lot of caching on various levels to reduce the usage of the
Starting point is 00:16:44 source APIs to avoid going over these rate limits. And we also have, like, we are trying to stagger the queries so that not all of the queries are fired at the same time and things like that. And obviously, we do have a very good monitoring of our connectors in place. So if there are any issues, we can react immediately. Okay. And again, with the basic connectors and Google's Data Studio ones, how do people go about maybe trying to stitching together maybe sort of data from, say, Facebook ads
Starting point is 00:17:18 and from, I suppose, AdWords and so on? Is that something they do themselves or is there anything you can do to help with that at all to try and create a kind of, I suppose, a unified view of campaigns? So there are several ways how people can do that. Data Studio nowadays has a data blending option, so you can make one query with our Google Ads connector, one with our Facebook Ads connector,
Starting point is 00:17:40 and then use Google Data Studio's data blending option to merge that data together. That data blending option does have its limits. So what many people actually do is they use our Google Sheets connector to get all of that data into Google Sheets.
Starting point is 00:17:59 They're merged that in whatever way they want. Maybe they do other processing on that as well maybe they even have imports from sources that are not super metrics and merge that data as well and then in google data studio they use the google sheets import to get that data from google sheets into data studio okay okay um so so that that really is a good lead into i suppose the thing that got me again interested
Starting point is 00:18:25 in speaking to you which was um i i yeah i heard i noticed that there was um that actually uh your products or your connectors have been featured uh are now featured in the uh in the uh google cloud marketplace uh actually now fully fledged connectors for loading data into into bigquery which is which is another kind of stage on really from what we've been talking about. I mean, just maybe just tell us what that's all about and broadly, what's the kind of value proposition really with those connectors? So this is a very, very interesting new area for us.
Starting point is 00:18:55 So like I said, the company started from spreadsheets, then we expanded into Google Data Studio, and we have connectors for Tableau and Power BI and the other BI tools as well. But now for the first time we are also bringing this data into a data warehouse, into Google BigQuery. So we were Google's launch partner
Starting point is 00:19:18 with a new data transfer service product that they launched at Google Cloud Next just a few weeks ago in San Francisco. What this service does is that it allows for very easy import of data from various sources into Google BigQuery. So with just a few clicks, you can set up a transfer from, say, Facebook ads.
Starting point is 00:19:48 And then that transfer will run every day to get your latest data into Google BigQuery. So you can get a marketing data warehouse up and running in just a few minutes. And that's pretty cool. Okay. Okay. just a few minutes and that's that's pretty pretty cool okay okay so so the actual i mean the technology behind that is that is is it the same same technology you're using for for bringing data into say google sheets or or have you kind of hooked into a kind of google pipeline for doing this i mean is it any different at all so the underlying connectors to the source apis all of our products use the same same uh backend that has
Starting point is 00:20:25 the uh that has these high quality connectors that we have developed so that is the same uh but then obviously the technology of getting that data then into big query that that is uh completely new and it uses this new google data transfer service that they opened up now for third-party developers such as us. Okay, okay. So, I had a quick look at the marketplace again recently, and there's connectors for yourselves, there's connectors from Fivetran as well, and so on. And in a lot of cases, well, no, in some cases, there's some, not duplication, but
Starting point is 00:21:02 both companies, other companies will cover those as well and but it struck me that you're with your connectors you're maybe again serving a slightly different type of user or a slightly different type of market to to um the ones that say other companies connectors are aimed at i mean maybe just tell us about that really and how that's influenced the design of the product what What we did here, so there are a few things I would say that differentiate us from some of the other companies. Obviously, there are other companies like Fivetran who have been in the data warehousing, marketing data warehousing market for many years,
Starting point is 00:21:37 and we are like newcomers to the market. And these companies, they are good companies, and I think they serve certain purposes very well. But I think there's still an opening there for someone like us to do some things a bit differently to make things easier for a certain segment of users. So first of all, I would say the ease of setup. So when you use our connectors with BigQuery, you can handle the setup fully inside the BigQuery UI. You don't need to go to external tool to register or configure anything. You can set up everything inside BigQuery. And it literally is just a few clicks to do that.
Starting point is 00:22:21 So you can get the Facebook ads transfer running in two minutes. Then we have the really high quality connectors to all of these marketing data sources. So we are coming from the marketing world and we really know this data well, and we are known for our high quality connectors. So anyone can say they have a Facebook ads connector if they bring five metrics and split by campaign. But we really aim to get all of the data that is available in these sources. So we really aim to create high quality connectors and also update them whenever the APIs update to make all the recent new features available to our users. And then some things on how we actually bring the data into BigQuery.
Starting point is 00:23:13 So what we wanted to do here is make it really easy to do reporting on that data. And what we see many other companies do is they come maybe from a more traditional data warehousing world, and they aim for these normalized schemas. And that's not really the easiest thing for an analyst to use in a reporting tool such as Google Data Studio.
Starting point is 00:23:46 For instance, in Google Data Studio, if you use BigQuery data there, you would generally select one table for reporting and then Data Studio would show you all the columns in that table as fields. But with a normalized database schema, that doesn't really work very well because most of the data is not in one table.
Starting point is 00:24:09 It's spread out all over and you need to do joins across those tables to be able to really do good reporting. So those normal schemas are not really well suited for these BI tools, or at least not for marketers who don't know SQL. Of course, if you know how to write SQL, then you can do the joins, and then that kind of schema will work for you. But most marketers, they don't know. So what we wanted to do is we make the normal schemas, huge tables which contain a lot of data.
Starting point is 00:24:51 And then it's really easy to use that data in the reporting tools. So the marketers don't need to join between tables. They don't need to write SQL and they don't need to bother developers on helping out with reporting. Yeah, definitely. I mean, yeah, totally agree. I mean, so my experience with Data Studio was, as you say, you point it towards a specific table. Now, you could obviously pre-join dimension tables to that big table, and then you could therefore report against that. But then you hit kind of issues where you're joining to, you know, or you're bringing a lot more data in than you would expect to really.
Starting point is 00:25:26 And do you create a view for every potential combination of kind of dimensions and facts? And it gets complicated a lot quicker than you think it's going to be. And I think the other thing that struck me was I brought in Google Search Console data through your connector. And I brought it through actually Console data through your connector, and I brought it through actually a Stitch one I tried it with as well. And granted, the Stitch one was through the Singer project, so it was an open-source one that was community-supported. But it was interesting seeing the different, I suppose,
Starting point is 00:25:55 different ways that the connectors bring data in. So something like Stitch would often bring the data in, and you'd have to then go and use things like unnest commands to actually unnest nested columns into something you can kind of work with, which if you know SQL and if you know BigQuery, it just takes you two seconds to do. But it's a kind of a level of complexity that will confuse people. And then you get, say, maybe sort of like, I don't know,
Starting point is 00:26:20 the five trends of this world where they would have dealt with the unnesting for you, but it would be a normalized schema. And your one is you go for the big wide schema, really, which suits Google Data Studio. And it also suits BigQuery, which is typically, you know, best at doing aggregating and projecting queries. And that's the main thing it aims for. So it struck me it was a well kind of aligned solution with big query and it's
Starting point is 00:26:46 a well-aligned solution with the market you were aiming for really yeah definitely and as you said this is the way that google says big query should be used with the normal schemas not doing joins across tables and that's what we went for so it's both easiest to use for the on the reporting side and it's the most performant. Yeah, and I think the last thing really I noticed was the quality of, I suppose, the table metadata as well. So something, again, something that if you're in news digital marketing, or if you're even just not as absorbed in it as kind of you would be or maybe I would be, understanding the meaning of various metrics is important.
Starting point is 00:27:24 And again, you've gone quite a way haven't you i think with actually putting a lot of very meaningful and and um and useful table comments and metadata in there as well yeah yeah we really uh have tried to we don't have descriptions for all of the columns but we are really uh trying to to cover all of them uh so to make it very easy to understand the data. And I think the client feedback we have gotten so far has been really, really positive, saying that or really appreciating the way it's made it easy to work with the data.
Starting point is 00:27:56 So do you think this is going to, are you still in the end aiming at the same types of users and customers or is this kind of broadening a little bit the market you think you're going to be serving? Well, we hope that it is broadening it definitely so obviously there's part of our current client base that is doing things in google sheets that really should not be done in spreadsheets so they are really stretching the limits of a spreadsheet and previously we didn't really have a better alternative for them. But now we do. So we can say, okay, use Google Sheets, no, Google BigQuery to get the data into Google BigQuery.
Starting point is 00:28:34 And actually, one interesting thing that Google also launched at Cloud Next is connected sheets. So you can actually have the data in BigQuery, but you can still use the data in a spreadsheet in Google Sheets. So you can put the formulas and visualizations and whatever in a spreadsheet while the data is still in BigQuery. So that's pretty interesting. Interesting. Another thing that was launched was the BI Engine as well. And I think there's maybe kind of very happy confluence of things there because you've got Google Data Studio, which is the kind of first front end for bi engine and potentially bringing your data in as well i mean is is that something that you think is is interesting and potentially
Starting point is 00:29:12 could be interesting to customers to use alongside your data as well yeah definitely so what's also interesting with our solution uh for for big query is that because we are originally from the reporting side, also with BigQuery, we didn't only focus on getting the data into BigQuery. We also thought about the reporting side, and we actually created a Data Studio connector that is dedicated to working with the schemas that we import into BigQuery. So that's, I think, a pretty interesting connector. So what it does is you select BigQuery dataset, and it reads through that dataset. It sees all the tables there. Then it, because we understand that table naming convention and everything, it sees which data sources have been uploaded to that data set.
Starting point is 00:30:10 And then what it can do is, in Data Studio, when you then look at the schema, you will actually get not the column names from BigQuery, but we fetch the friendly names of these columns from our backend. So you would have nice looking names for all of the columns. Then we can set the data types for all the fields such as, okay, this is country, this is city, this is date and all of that stuff.
Starting point is 00:30:43 So this kind of metadata layer is handled for you. And even more, we can add calculated metrics to the connector. So we see, okay, in BigQuery, you have clicks and impressions in this table. So we'll add a calculated metric for click-through rate. So you don't need to define all of these various calculated metrics yourself but we bring them through the connector automatically and that's quite a big benefit because in many of these data sources you would have a really large large number of calculated metrics actually that you would not have directly in the in the
Starting point is 00:31:24 database but you would need to directly in the database, but you would need to define in the reporting side. So we do all of that for the user. And I think even more interesting for a person who doesn't write SQL is that our connector can automatically join or union between these different tables across data sources. So there are some fields that are marked as join fields. So we see that this field called campaign name, for example, is available in both Google ads and Facebook ads. So then if you select campaign name and then you select a metric
Starting point is 00:32:02 that is also available in both of these, our connector will automatically do a union query to merge that data from these different data sources together. So you get really easy cross-platform reporting in Data Studio without needing to understand SQL at all. That's fantastic. That's really good. That's really good. So, I mean, just to kind of round things off then, how would people, how would potentially interested people get hold of these connectors, your products, you know, what's the first place they'd go to to find out more details, really? Yeah, so
Starting point is 00:32:35 I think our website, so supermetrics.com, there you can find products, and there you see Supermetrics for BigQuery, if that is the product you're interested in. Okay, okay. And, I i mean you know is is it in terms of next things to be solved and other areas you're looking at is there anything else you can share about what you're looking at in the future as well in this kind of area yep so we are of course we are all the time building new connectors so that's a very obvious one, I would say. There are literally like thousands of
Starting point is 00:33:06 different marketing data sources that people are asking us to connect to. And we are prioritizing that list and going through and building these connectors all the time. So that's definitely one. We are also looking at expanding more into connectors beyond marketing, so into financial accounting data, HR, customer service, and various other data sources like that. And then in the data warehousing field, obviously now we have a product for Google BigQuery, but we are looking at Redshift and Snowflake and others as well in the future. Fantastic. That's really good. Well, Mikael, it's been fantastic speaking to you. And it's been great to speak to, I suppose, the person behind the product I see so many times in
Starting point is 00:33:56 customer sites I visit, really. And it's great news about the Connected to BigQuery as well. So thank you very much for coming on and telling us about it really and telling us a bit of the story behind you and the company. Yeah, thanks for having me. It was really nice being here. Thank you.

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.