The Data Stack Show - 143: Collaborative Data Analytics on the Data Warehouse, featuring Rob Woollen & Stipo Josipovic of Sigma

Starting point is 00:00:00 Welcome to the Data Stack Show. Each week, we explore the world of data by talking to the people shaping its future. You'll learn about new data technology and trends and how data teams and processes are run at top companies. The Data Stack Show is brought to you by Rudderstack, the CDP for developers. You can learn more at rudderstack.com. Costas, I, you know, we've talked with a lot of database companies on the show, but I love talking with analytics companies just because it seems like both databases and analytics tools are so pervasive and seem to actually be proliferating at a pretty high rate.

Starting point is 00:00:46 And so today we're going to talk with Sigma, and they have a fascinating approach to analytics, sort of a spreadsheet and SQL focus. So we're going to talk with Sipo and Rob. Sipo's from the product side, Rob is CTO and co-founder. And I have so many questions. I mean, number one, spreadsheet and SQL, you know, the two fundamental tools for analytics, like why build a tool that is a spreadsheet and SQL, you know, I mean, that's going to be fascinating to hear.

Starting point is 00:01:19 But they also have an interesting story. They sort of came of age right on the heels of Snowflake, sort of out of the same group. I think that'll be really interesting to ask about. So yeah, I think those are the things that I'm going to ask about. How about you? Yeah, I definitely have many questions about how to bridge the spreadsheets and the relational paradigms together. I think that's part of the success of Sigma, that they might not

Starting point is 00:01:46 somehow like to do that. So one thing is to understand how they did it. What is left out there, because I'm sure that there is still work to be done. These are two of the foundations of working with data that usually don't overlap. Each one has its own pros and cons. There's a reason that we have both and we don't have only one, right? Yeah. And I'd love to hear both the product and the technology story behind it,

Starting point is 00:02:22 because there's very different challenges there, right? And I think we have the right people to do that. It's one of these also rare occasions where we have both product and engineering representation at the same time, so we can ask the same question from a different angle and see the differences in perspective. So we're definitely going to spend a lot of time on that. All right. Well, let's dig in. Let's do it.

Starting point is 00:02:54 Steepo, Rob, welcome to the Data Stack Show. We have a million questions, so very excited to get to it. But thanks for giving us some of your time. Thank you. Happy to be here. All right, Steepo, let's start with you. Give us your background. Tell us how you got into data and then ended up at Sigma. Sure. So I guess my career has been a little bit of, you know, at the time felt a little bit like jumping through Goldilocks stepping stones of

Starting point is 00:03:20 random jobs. First, starting with working in public policy, doing data analysis, to which I had a pretty strong reaction, jumping where, you know, if you've ever worked in government, you'll know how much technology is involved. And for me, it was not enough. So I made a pretty hard pivot to the next Goldilocks bowl. And I jumped into full stack engineering where I worked on, somehow ended up inadvertently working and building a internal analytics app for the company I was working for. So inadvertently to ANBI without knowing what AN After that, you know, I think my next journey was in my next step and my realization was that where I really wanted to be is not just in the processing data. And it wasn't just really sort of building the tools, the infrastructure to work with data, but it was in

Starting point is 00:04:17 decision-making itself. And so that's when I made a cut over to, to actual BI management here in San Francisco with a company called PagerDuty. So as a practitioner, then a little bit into my journey, a tiny little startup decided to reach out to me to ask what I thought about their product. And this company was Sigma. And I was the second person outside of the company to see the product. And from that first time that I saw it, they might have oversold. And so clearly, I joined as a sales engineer and transitioned over to the first product manager. And I've been doing this for about five years now. Awesome.

Starting point is 00:05:00 Okay, Rob, I have to know, were you part of the group that got early feedback from SIPO and PagerDuty? No comment. We use the word oversold, so I had to ask about that. But Rob, give us your background. Absolutely. So let's see, I've spent now, gosh, more than 25 years in enterprise software. And I've always built things that in some ways they're not finished products. They're things that other people take and build on top of or do something to build their own products with.

Starting point is 00:05:36 So I actually started my career working in the Unix kernel. I thought that was like the coolest job. I was, you know, the technology nerd and loved Unix. It ends up it really wasn't actually that interesting because it was always a lot of just like people just expect kernels to work. There's not like a lot of interesting features they want. And so I remember in like the mid 90s, I was like looking at a bus signal and trying to debug this, you know, crash. And all my friends were working on like these cool new dot com startups. And I was like, I want to go do something that sounds a lot more fun like that. So I moved up to San Francisco and I joined this company, BEA, that was building a product WebLogic. And the whole idea was really try to build kind of an operating system for the web and really sort of give developers a platform and Java at the time to build out.

Starting point is 00:06:26 I ended up spending nine years there, building that out, taking it out to be a pretty successful business. We were a multi-billion dollar business. And in about 2007, I got really excited about this idea of cloud computing. I remember talking to some friends at Salesforce and even just the idea that, you know, I left a company where I think we had 40 versions of my on-prem, you know, stack that was maintaining. And they were like, we have only one version. And that was about enough. I was like, once I heard that, I was like, I'm moving a few blocks. I'm going to work there. So I spent seven years at Salesforce building out a cloud platform for them. I spent a lot of time

Starting point is 00:07:06 there trying to think about, again, what would you need to build out a product on top of a cloud stack? In Salesforce's case, it was both, you know, their internal products, like their CRM, as well as things that third parties would try to build. And I left Salesforce then in about beginning of 2014 and joined a venture capital firm, Sutter Hill. And I left Salesforce then in about the beginning of 2014 and joined a venture capital firm, Sutter Hill. And I joined there because of a few things. One is I was crazy enough to want to go start a company. But the second is I really liked that what makes Sutter Hill different was that instead

Starting point is 00:07:38 of just like a normal VC firm trying to invest in kind of, you know, interesting pitches they see, they actually tried to start companies from scratch. And so I watched them do that very successfully. And I went and made that leap. And as part of that, they helped us found Sigma. And so we founded the company in 2014. And I've been our CTO and obviously co-founder ever since. Very cool. And could you just give us the quick, what is Sigma? Tell us what Sigma is. Absolutely. So the way I think about it is we started from this idea that people were going to move to cloud data warehouses. So we were early on seeing this transition to things like Snowflake or Google's BigQuery or Amazon Redshift. And we thought about those environments and said, what is the interface that you'd want to build on top of one of these

Starting point is 00:08:30 things? And in particular, like if you wanted to say, hey, I want everyone in my company to leverage this like massively scalable repository of data, what is the interface you'd want? And so what we built is something that lets multiple people work together, collaborate, build out very complex analysis. And the interface we chose to sort of base our product on is the idea of a spreadsheet. And so what you do in Sigma is you can build these collaborative workbooks where, say, Steepo and I could build a bunch of analysis together. We could interact live and share things back and forth. And we can write formulas. We can, you know, just like you would in sort of any of these sort of traditional spreadsheets.

Starting point is 00:09:11 But the magic of it is that everything you do in the product actually gets translated to SQL, executed against that scalable data warehouse. So you're combining all the sort of benefits of a traditional kind of centralized, like technical data warehouse with something that's actually broadly accessible and very powerful. So that was the original kind of, you know, idea and pitch and what we built out. Love it. Well, I want to return to the founding. But before we do that, Stipa, I want to go back to part of your story. So you built sort of homegrown analytics products inside of a company as sort of one of your first forays into full stack engineering. And I've never done that personally, but knowing some people who've worked on that stuff and knowing that there are companies like Sigma who literally build big companies around this.

Starting point is 00:10:06 Did you have any takeaways from that? Was there ever a point at which you said, maybe we should have bought something instead of trying to build this homegrown? Maybe I have the use case wrong. That seems to be sort of a formative experience. And so I'd love to hear about the lessons that you have taken away from that and sort of influenced the way that you think about building analytics products? I love this question so much. I've never, I never regretted that decision. And even still, you know, knowing where the market is, I still, I, you know, if I could go back in time with the same market, I, barring Sigma, you know, I would make the same decision again.

Starting point is 00:10:48 The reality is that the way that people work with data right now and with traditional BI is so limited. And the behaviors and what we needed at the time at the company that I was at was just, there's no tool that could easily support it. And in reality, it wasn't just choosing a tool. It was choosing an entire sort of infrastructure for how I needed to prepare my data to be able to work with it. So there's these like layers and layers. It wasn't just the top level of how do users interact with this. And I, at the time, as an engineer was just trying to understand how do these layers interact? What are these models that I have to think through? And if I'm having a hard time understanding how to work with this data, and I'm supposed to be the technical person doing this work, how the heck is anyone

Starting point is 00:11:34 else supposed to work with this? And so, you know, I think it's influenced a lot in how I think about the end state of where we need to go. But I also feel that all of my time working with traditional BI tools just as much has influenced that too. Love it. Rob, let's go back to, you gave us a little bit of the founding story, but Snowfl know, sort of one of the founding stories out of Sutter Hill and was in pretty close timeline wise to Sigma, right? I mean, you're talking maybe a couple of years there. What I would love for you to dig into just a little bit relative to the founding story is an interface on top of a data store is a very, that's not a new concept, right? And even when you started Sigma, that was in the enterprise, at least sort of a prevailing paradigm of how you did, you know,

Starting point is 00:12:32 business intelligence, right? It's like, okay, we have a data store and we have some sort of layer on top of that, you know, whatever. What about the existing paradigm that, you know, from a basic architectural standpoint, data store interface, what about that wasn't working? And like, what made you, I mean, really, in many ways, it kind of sounds like you're saying like, the data store is completely different. And so we need to think about the interface completely differently. Is that a good summary of the way that you were thinking about it? I did there and then stole my answer. Yeah. But I will give you a bit more color as you'd imagine.

Starting point is 00:13:11 So you're right. We were, in fact, the Snowflake team had just left Sutter Hill maybe six months to a year before I joined. My co-founder, Jason, was actually advising at Snowflake during that first year, helping them build out their optimizer. And I remember I was at Sutter Hill when Snowflake came in, I think for their Series B. They did their pitch to sort of, you know, follow up with their investors and show off the product. And I vividly remember the presentation because it was, they were showing off at the time how you could elastically scale you could drag a slider and have it you know have more CPUs or less CPUs and obviously the queries run faster or slower and from you know honestly from the podcast right now people are gonna be like yeah that just sounds like what you do right but if you rewind your

Starting point is 00:14:01 mind back to like 2014 like what people used to have to do, and sometimes I, you know, feel like the old person who has to tell people like back in my day, but back in my day, you had to go like rack servers, right? So, you know, if you wanted eight CPUs, you were like dragging stuff around and putting cables in or most likely requisitioning someone from IT to go do it. So as a technology perspective, I was sitting there like this was mind-blowing. But when I watched them demo, they literally had pages and pages of these 700-line SQL queries. And so that was where I think a lot of the status quo has been in this environment. Like the interface that people have traditionally built on top of warehouses is designed for a very small number of people.

Starting point is 00:14:52 And it's designed for a very small number of people on purpose. If you had a traditional, that traditional warehouse where it was very painful to expand it, then you'd be very protective of it. You would say like, I do not want a lot of people running queries on this. The story I'll tell you is even when I was at Salesforce back in the 2000s, we had, as you'd imagine, we had an on-premise data warehouse. And at the end of the quarter, they would have to send out things saying, please stop refreshing your reports. You know, the warehouse is getting overwhelmed.

Starting point is 00:15:24 That was just the reality of how these systems worked. When you step back and say, I have something where I can actually scale it out, I can partition resources, I can make all these decisions about how I want to manage the workloads. That's when I think you really start thinking about how do I want to redesign the interface? And that was the fundamental sort of technology that inspired us to start that whole journey from over again. We were, of course, fortunate that we were there to see these. We were at the same VC firm. We saw the early work. We knew the founding team there. And so we had sort of an early bird's eye view into that change.

Starting point is 00:16:00 Sure. Well, I mean, that's the wonderful thing about Silicon Valley, right? Is that, you know, technologies can sort of emerge together. So Rob, that's super interesting. You know, at the time, you know, 2014, so almost a decade ago, some people might have said like, well, sort of the insights that you need, you can run queries on top of a data store, and you can visualize it. Great. Fast forward 10 years, maybe it's even more of a solved problem now. But the market would actually suggest differently. In many ways, there are new analytics, paradigms and technologies coming out that would, in some ways, suggest like, well,

Starting point is 00:16:49 maybe we're sort of in early innings in terms of what's actually possible with analytics. So I guess this is a question for both of you. Sipo, maybe I'll start with you. Are both of those things true? Like, is BI over? Are we in the early innings of analytics? Like, give us your perspective. I guess I'd start with by saying that the BI of old is dead. And I think there's a new era. We're in the early innings of a new era. And what I mean by this is certainly we are, I think the BI tools of old did provide more access to the centralized data stores and all the power that's there and then the size of the data that's there, right? I mean, crucially, that's sort of the most important piece of what they offer, which

Starting point is 00:17:33 is at the high trade-offs that they require, which is users can't work with data that they, the way they're taught to work with data, the way they think about data, instead, they have to be limited in the ways that these tools provide. So I think that model, I think, is what's really up, is being challenged. And certainly, it's what we are challenging. The core of this is really about going back to the basics and the fundamentals rather than giving tools to people and telling them to learn the tools to answer the questions that they have the new era of bi is to give people the tools that meet them where they are right that remove

Starting point is 00:18:17 the friction because being able to answer the question is almost the least interesting thing that they can do but the most interesting thing is asking the question and being able to get to the answer. And so, and when it comes to, and being able to do that with massive amounts of data, spreadsheets were able to do that and are able to do that on small amounts of data. So, you know, there are many things that we're doing as a product right now that's trying to do this.

Starting point is 00:18:43 For example, what we've just released, which is input tables. And it's almost unremarkable in how I'm going to describe it because it also seems just so absolutely obvious. And yet it's something that BI has not been in a position to be able to provide and technology hasn't been able to provide. And input tables are very simply UI managed tables

Starting point is 00:19:02 where users can add their own data, integrate it into their analysis, join it, and then modify that data, iterate on it. And it opens up a massive world of use cases that historically BI wouldn't be able to solve. And now these use cases that were once either in spreadsheets, offline, decentralized, all the bad things about working with data in the modern times, or they would happen in centralized, specialized tools that, again, even fewer people are able to

Starting point is 00:19:31 work with and use. And so now there's this middle ground, right? And that is the platform that we are building, which is it gives you the flexibility to work with that data, restablishes that relationship that we as humans natively have with data, which is we want to be able to manipulate it. No one wants to have to think about, I want to ask a what if, and so let me figure out a feature and take 20 steps to be able to do that. Instead, what they want to do is engage with that data and modify it. And just by nature of doing that, you can forecast, right? You can analyze data that doesn't exist because, you know, the centralized data stores are not all the data that exists, right?

Starting point is 00:20:10 There's so much human context with everything that happens. And really, so what, and then on top of that, once you capture that context, there's the question of what now, right? At a certain point, when you're able to capture data through your platform, you can capture decisions. At a certain point, when you're able to capture data through your platform, you can capture decisions, which is incredible because up until now, the BI of old was the end of the pipe. You perform your analysis, you get to your insights, and then you go bugger off and try and take action on that. And it all happens in this ephemeral kind of organizational gray area that's never captured. It's very difficult to follow through on. There isn't a common understanding. And now you have this data.

Starting point is 00:20:52 Now you're capturing this exactly where you're performing your analysis. And you can capture these decisions. You can learn from them. You know what you did last quarter. You know what was effective and what wasn't. And so you have this sort of compounding effect of what analytics and BI can provide in this new world. And then on top of that, if you're already capturing your decisions, why can't you take action? Why can't instead of, you know, capturing your decision that you're going to go to X, Y, and Z place to update X, Y, and Z thing, why can't you just do it in a single platform? So when I think about going back to your question of where are we in the state of BI?

Starting point is 00:21:31 And certainly the old BI is dead. But where the future is very much about maximizing your data and being able to work with it the way that you think about working with it. Yeah. So this is just a quick, nerdy personal question on input tables. But if I understand correctly, you know, you essentially have the spreadsheet interface, but it's, you know, it's actually a table that lives in the warehouse. And you can sort of augment that. And you're interacting with that data.

Starting point is 00:22:05 And so will it actually, you know, sort of if you, whatever, if I'm running a calculation or, you know, something of that nature, will it actually push that back down into the warehouse? It's your data.

Starting point is 00:22:19 And I think this is, you know, going back to, you know, the very beginning of my, you know, to your question about the decision to build a custom app, one of the things I mentioned is just all the layers that you had to really buy into on top of the BI tool or analytics tool itself. So we sort of sit directly on top of your warehouse, and your data is your own. So input tables will write to your warehouse. We are just a simple, clean layer on top that allow you to maximize the value of what you have.

Starting point is 00:22:54 And we also, this is a pivotal moment where we're allowing you to deal with the challenges of having a centralized data store, which is very simply that everything that you need isn't always there. So yes, that's exactly the point. And so what this means is you can add your data, it gets written to your warehouse, and you then also have it available to use it anywhere else you'd need as well. We're not locking you down.

Starting point is 00:23:18 We don't have, like, we're not going to ingest your data and hold on to it. It's your data, and it's there in the warehouse for you to be able to work and join it against all the other live data that you have. Yeah. Yeah. I mean, that's incredible because I think that one of the amazing ironies of doing analytics, even in modern forward-thinking companies, it's like, okay, we're running Snowflake, we're centralizing our data and all this sort of stuff. I mean, name a company where maybe it's Sigma customers who don't do this, but where there's someone in marketing, someone in

Starting point is 00:23:58 revenue operations or something, they ask the analyst to produce a view in Snowflake and export it so that they can work on it in a spreadsheet, right? I mean, it's pervasively common, right? That is unbelievably ironic, right? It's like an anti-pattern, you know, relative to some of the promise of like centralizing your data and querying it, right? It's like, wow, we did that literally

Starting point is 00:24:20 just to like get it back out into a spreadsheet. Yeah, I think there's an old joke that BI tools had three buttons that were by far the most popular. Okay, cancel, and download to Excel. Absolutely, absolutely. Well, Rob, that actually leads me to another question. So the reason that that is one of the most popular buttons is because the spreadsheet is,

Starting point is 00:24:50 you know, an interface. Many people say, no one's ever going to kill the spreadsheet. Maybe it gets reinvented, right? But I think to your point, Sipo, it's how people ask questions. I mean, visualization, you know, that problem

Starting point is 00:25:03 was solved a long time ago. But to your point, the problem is that in order to understand what you need to visualize, you have to actually work with solved a long time ago. But to your point, the problem is that in order to understand what you need to visualize, you have to actually work with the data and ask a bunch of questions. And then like, okay, once I get there, I can visualize it. SQL is similar to the spreadsheet in that, you know, many people say, you know, no one's ever going to kill SQL. Can you talk about the thinking behind sort of combining the two most common analytical interfaces into sort of this new interface that you've built for modern data stores? I mean, to me, that's fascinating and that's brilliant.

Starting point is 00:25:39 Because to your point, Sipo, you're meeting people where they are. But can you talk about the way that you... Were those in mind when you thought about creating a new interface or was that a conclusion that you came to after, you know, doing research and talking with potential users? I'll admit to you that it was not our first idea. It took several rounds of failing, like many startups, of figuring out what was the right way to solve the problem. So we've always been trying to solve the same problem. We've always looked at it as, I have this incredible advance in cloud infrastructure.

Starting point is 00:26:14 How would I build the right interface for it? I think there was actually a big moment in sort of, like any of these things, it was a big moment that took several months to realize it was a big moment. But a lot of our first efforts, I actually think, followed a similar paradigm to what many people do with dashboards, which is they build something that they think other people want. And they basically, at some level, even though we don't always want to admit this as technologists, we think, like, I'm a little smarter than the person who's going to use this. And so I'm going to give them only very simple things for them to do, but I'm going to take care of, you know, the hard part. And the irony in all of this is like, we all step back and say, look, like, I don't really know a lot about whatever department is consuming this, but in our minds, we're still like, no, you know, I need to be the one to tell them what they need

Starting point is 00:27:03 to analyze as person X. So I think that was one of the real sort of big realizations when we went and looked at like spreadsheet users and what they actually built. Most of the time I was like, oh my God, that is like this crazily complex thing that I had no idea of, you know, I don't know anything about what these people are talking about. And I look at it and from a, you know, math or statistics or, you know, analytics perspective, I can follow it, but like the domain knowledge I don't have, I would not have known to do this analysis. And so that was a big pivot in our interface design because we stepped back and said, I want to build an interface that you can build the equivalent of any SQL query without writing code. And if you think about that challenge,

Starting point is 00:27:47 that's very different than starting and saying like, okay, I just want to make it so that, you know, filter or, you know, change the dropdown. You instead say, I have to build an interface that's really powerful. And that's what makes it so hard is because honestly, it's very easy to say, I'm going to build an interface that's, you know,

Starting point is 00:28:04 as powerful as SQL, but as simple as a spreadsheet. It's very hard to actually make that possible. It's taken us many iterations. We actually redid our core interface even as recently as 2021. It's, again, something that you're going to iterate on with customers. You're going to learn so much. And it's fundamentally a hard problem to solve. On the plus side, it's the kind of problem I love solving because it's exciting technology to work on.

Starting point is 00:28:32 And you can sort of incrementally discover things every year and make it better. One thing that I would add on top of all of this is we're not necessarily choosing. Those two things are not necessarily at odds because there are people who love writing SQL and there are people who love spreadsheets and operate through spreadsheets. And what's really neat about our platform is the ability for both of those people to be able to work together and also build on top of each other's work. And so I think that's one of the most critical aspects of being able to have a platform for all. And it is, we're not compromising, we're empowering everyone. And the

Starting point is 00:29:12 fact that I can write SQL and hand something off to you, and you can then build on top of this, whatever analysis you want, that transition is incredibly powerful because it brings, there's wisdom that both parties bring. Certainly the people who tend to be to be more business oriented to have the knowledge and the context of the data and the people who are technically oriented to have more knowledge of the data structures, et cetera. And for those two to be able to collaborate in solving a problem also in real time, I think is just absolutely game changing. And I think that's, you know, more and more is going to become table stakes for what this industry needs to deliver. Yeah, I think working together and not over Jira tickets is just like, that is a fundamental change.

Starting point is 00:29:51 I hear great rejoicing among the audience. Steve-O, I have a question. You mentioned a couple of times about, let's say, the BI of the past, or how BI was done until today, or a couple of years ago before, let's say, Sigma started. Can you give us a little bit more context on that? What does it mean to do BI in the old way? How does it look like? What's the process and what's the experience that the user has? Absolutely. So the BI of the past is a lot of process. It is a lot of time and a lot of heartache. And I say that in the sense that at a certain point, when you have a reasonable question,

Starting point is 00:30:40 and if you're a curious person who's trying to do their best to just to do their work, to answer their question, to accomplish something, you would be stuck in this at this fork in the road where you make this. You're dependent on other people and everyone is dependent on other people. You're dependent on a bottleneck to be able to get to what you need. And so you have to make a decision of. Do I wait for my data or you have to make a decision of, do I wait for my data or do I just make a decision now? And the cost here is going to be two, three weeks of my time.

Starting point is 00:31:11 So from the user perspective and from the business perspective, what you end up with is a lot of suboptimal issues. You in the, and suboptimal results really, where there's no great path for you to success, despite trying to do everything you can. Now, from the kind of the entire process side of it, what is actually this flow?

Starting point is 00:31:35 It is you have people who are answered in order to make it worthwhile to invest in getting the right data in the right format. And then it is to build out what that person needs. Then it's to go to them and vet with them and validate that question is actually answered. And if it's not, you then have to go back to the drawing board because you've messed up somewhere, or rather, there was some sort of communication gap between the two parties. And you have to go rectify that and work through that process with the data. And now, it could be that, and this is in, you know, I say this agnostically without calling out a particular technology because this is just the standard, like, way to interact and way to engage. You know, it could be that all of your data is just in the warehouse. It could be that it's a tool that requires ingestion. It could be like that, you know,

Starting point is 00:32:48 you have different levers and different amount of mechanisms to be able to interact with the data in the end, but the story is always the same. And it's just small little optimizations on that. And so when I say, you know, BI of old, that is really what I sort of talk about and what I think about. And, you know, I think it's that it's where this really lands you is in a world of uninteresting questions being answered asked and answered because it's far too much work to be able to ask the the interesting ones because you need to be able to iterate and that you know that data those questions those answers are going to come when you're working with the data directly it It's not going to happen in three, you know, three-week cycle times to get one tiny little bit of obvious kind of feedback back.

Starting point is 00:33:32 Yeah, that makes sense. And I have a question actually for both of you, but from a different angle. One angle is going to be more of the product and the user experience and the other a little bit more technical for you, Rob. So we are talking here about Spreadsheet, which is like a very, I would say, like word defined way of working with data. It's almost like a paradigm of like computing or like writing code in a way, right? And then we have the relational model, right? The one that the data warehouses out there are using like Snowflake, BigQuery, et cetera, et cetera.

Starting point is 00:34:12 Now, these two, although let's say they have the same expressivity in a way, they are not exactly equivalent. Like they have their pros and cons. They do some things better. Some other things like are harder to do. Yeah, you can do everything if you want at the end, but it's not going to be as easy. And you mentioned earlier that one of the benefits of working with something like Sigma

Starting point is 00:34:38 is that, yeah, we have spreadsheets there. You can use this environment. But at the same time, if you prefer to write SQL, you can write SQL. We want to take people and collaborate regardless of what tools they want to use. So how is this achieved? Because personally, as someone who has tried to do similar things,

Starting point is 00:35:01 to take different users and put them together, work on the same platforms, it's a really hard problem, right? So, let's start with Steep on the products side of things, and the user experience, right? How you can deliver, let's

Starting point is 00:35:20 say, a consistent experience at the end, regardless of like the person there, and let's say the language that they are using. And then I'll come to your the end, regardless of like the person there and let's say the language that they are using. And then I'll come to you, Rob, and ask more of like, let's say what happens behind the scenes, right? How we can translate one thing to the other. So one of the most interesting things to me about Sigma is the model through which you

Starting point is 00:35:41 interact with data. Now, what I mean by this, and I'll also take this back to noting, when Sigma oversold, what really oversold me was when I saw what the product was at the time. And at the time, all it was was a table. It was a table with pivots. Now it is the smallest, really small little Lego of what our product offers. But it was just a table on which you can perform pivots and add formulas. Now, the reason that was so that that hooked me so quickly and so easily and to such an extent was because this model isn't what I'll go back to the traditional BI tools offer. I think what you typically get in a BI tool

Starting point is 00:36:26 is the concept of measures and dimensions, right? The measures of dimensions, it's like it's aggregates, it's group bys, right? It's very easy to translate dimensions and measures to the way that databases work. And I think there's the, what Sigma offers and the interface and the paradigm shift is that instead of

Starting point is 00:36:47 working through dimensions and measures while being on the warehouse, you're working with data and then building on top of that, right? So in the same way you open up a spreadsheet, someone sends you some data, you want to perform some calculations, you'll open up that data and then you just, you build your formulas, your aggregations. If you need to, you can work and transform the data. You don't think about, okay, like what is the combinations of columns that I want? It is, you have your data and then you have the playground to be able to interact with

Starting point is 00:37:16 it any way or any way, shape or form that you want. Now that is, I think that's such a, it's a, it's almost a subtle or an implicit shift. And when someone sees our product, they understand it. And I think I certainly did. But it's a remarkable shift in the way that you think about data at scale. And so to be able to relate to the spreadsheet users, that's what makes Sigma incredible. On the SQL side, I think the interesting piece is because, you know,

Starting point is 00:37:51 you've got SQL users who are very familiar with writing SQL and you want those users to be able to work with the spreadsheet users. I think the beauty of what the product offers

Starting point is 00:37:59 is a single common language, which is we translate that, the spreadsheet to SQL behind the scenes. We have, you know, translate that, the spreadsheet to SQL behind the scenes. We have, you know, an incredible, incredible team here that's built a compiler that will take whatever you do through our UI, translate it into performance SQL. And then by virtue of having that common denominator, you can join these data sources together. You can build on top of each other's and you can build on top of and reference each other's work. So the way it's kind of hard to say, in my opinion, that, you know,

Starting point is 00:38:33 to talk about this as a almost as an intended thing to have been built, it's almost like these worlds sort of naturally come together and they overlap. And what comes together is sort of this beautiful synergy. Yeah, that makes a lot of sense. And Rob, same question from a different angle to you. I remember, and I think like everyone who has used both SQL and something like Spreadsheet knows that doing, for example, pivots on a Spreadsheet is very easy, let's say, right? Doing that in a database system is not the most straightforward thing to do. And in a similar way, but opposite,

Starting point is 00:39:14 joining data is something very natural to do using a relational model. But joining data on a spreadsheet is not exactly straightforward, right? So I'm giving two very obvious examples of the differences between the two paradigms. How, from a technical perspective, do you bridge these differences? And how do you create a model that can deliver, let's say, the best from both worlds at the end? Right. I think the best of both worlds is a great way to think about it.

Starting point is 00:39:48 You know, if you step back and look at a spreadsheet, no one ever talks about a spreadsheet in these terms, but if you think about it, it is essentially functional reactive programming, right? And there's no spreadsheet reader who sits down and says, I'm ready to do some functional reactive programming. That's fundamentally what you're doing, right? If you're trying to explain to someone who had never, ever seen a spreadsheet before,

Starting point is 00:40:09 you would say, there's this amazing thing. You change this value here, and everything that depends on it updates. And that core idea, I think, really sort of is pervasive throughout a lot of how we think about the interface and also the technology. One of the fundamental things we built is,'s true for, you know, columns and formulas, but it's also true even as you combine different things together in a document, what we call a workbook. So you might build, you know, a table element where you're going to do a bunch of transformations.

Starting point is 00:40:39 You might build visualizations on top of that. You may have other table elements that you're linked to. And fundamentally, you have this directed graph of here are all the sources that come in, the different calculations, their dependencies. And the way I like to think about our technology is we've built essentially, even though we don't market this way, the technology we've built is essentially a data warehouse. What's different about it is instead of it being backed, normally you back a data warehouse by disk, we are backed by another data warehouse. And so that means we do everything from, you know, we do local execution. We'll actually do

Starting point is 00:41:14 execution in your browser. If we have a subset of the data available and we can do execution there, we'll do execution there. We do optimization. So we have to do things like, hey, you've got, you know, multiple different of these visualizations and dependent things. We're actually going to, you know, optimize that and only issue one query to cover all of these things. We have a lot of the elements of sort of a traditional, you know, database or data warehouse. The back end of it happens to be that we're going to, as the last step, translate our representation now down to a SQL representation, instead of reading blocks off a disk, we're essentially reading from a backing data warehouse. So that's kind of from a technical perspective. And then to your point, there are different aspects of this you think about in the interface. And you're thinking about design in the product and how you want to expose things. So I'll give you

Starting point is 00:42:05 one example because you raised it of joins are like a very traditional part of the relational model. When you go to an Excel user, you say the word join, they may or may not even know what you're talking about. When you say VLOOKUP, they light up and they're like, yes, VLOOKUP is my thing.

Starting point is 00:42:22 And so we actually intentionally have both concepts in our interface. You can write a formula, which is a lookup, and is very close to a VLOOKUP in Excel. It is, hey, I want to grab some information from somewhere else. And it doesn't change cardinality. So it's a very safe thing to do. You're not going to change your results or twist things around. Whereas you can also go into the interface you know, twist things around. Whereas you can

Starting point is 00:42:45 also go into the interface and say, you know, instead of writing a formula, I want to explicitly do a left join, for example, right? And that is where a more, you know, sophisticated or technical or just data aware user is going to perform that operation. But we treat them separately explicitly because they tend to be separate audiences or separate use cases. And so there's cases like that where we try to embrace both. And there's cases where we very explicitly take a concept like this reactive and apply it everywhere throughout our product. That's super interesting. And one of the innovations, let's say, that they had that made it also quite popular was something similar to what you say about having, let's say, more technical persons that can go and write SQL. In their case, they had LookML.

Starting point is 00:43:36 So there was this language there that allowed you to do some kind of modeling, let's say, that then can be used and consumed by the business user and work in their more abstracted environment. And it was super successful, right? Creating these clear boundaries between, let's say, the engineering persona and the business persona, it seems that it is important. How is this done in Sigma? We have, let's say, a very business-driven paradigm, which is the spreadsheet,

Starting point is 00:44:14 and we can also write SQL. But how do you help the engineer, let's say, to do the modeling, to do all the things that they have to do for their job before the business user starts consuming the data. Right. I think about the world and really sort of two big camps of work I have to do. I have the things that are centralized and governed and known. And that known part is a real big, important part of it. And then I have things that are ad hoc. We are discovering, we are still trying to figure out, we are iterating on. And so either of these two extremes is bad. If you sit in an interface and you say, hey, you can only model things and look ML and the business

Starting point is 00:44:58 user can't change anything, it is a terrible environment, right? Any change you want, you have to go back to some centralized person and say, hey, can you change this? And to be honest, you don't really know what you want changed, right? So you end up just sort of iterating on the centralized model and just getting things wrong constantly. And both sides are frustrated. The other extreme we all know is also wrong, right? If you just say, do whatever you want in Excel, I don't care. I'm sure you'll get it right.

Starting point is 00:45:20 We all know the answer is you probably won't get it right, right? So that extreme was also wrong. So what we try to espouse in Sigma is to say, look, there are a set of known things, and that set of known things is going to change over time, right? We're going to learn more and more about what other metrics we want to govern, what other dimensions, tables, all that things. Once you sort of have a known set of governed things, those are the things that you push down into version control. We've seen a rise of DBT. And so, you know, we see a lot of people say, I'm going to use DBT because I want something that is, you know, open source and not a proprietary way to model things. But they're going to use that for sort of their centralized modeling layer. They're going to expose those models into Sigma.

Starting point is 00:46:00 And then the power of what we try to provide for people on top of that is to say, there's going to be a million things that people don't even know yet they want to ask about. We want to give them the power to figure those things out. And we may then push things back down. And, you know, part of this, as Steve was talking about earlier, this benefit of having everyone the same plot set is that the, you know, the data engineer can actually see what's going on. What queries are they running? What are they analyzing? And they can actually say, oh, I'm going to pull that down, and now I'm going to make something and govern it and pull it into, say, like a DBT model. That was never really possible before, right?

Starting point is 00:46:34 Once someone downloaded to Excel, you were like, I don't know what they're doing. Hopefully it doesn't show up in the board deck. It probably will. That's kind of the problem we're trying to avoid. We're trying to keep people on a central governed system of data and let them bring things into that

Starting point is 00:46:50 governance over time. Yeah, yeah, 100%. Actually, that's very interesting what you mentioned about the data engineer also taking a look into generated SQL and all that stuff. And I have a question on that because I've seen

Starting point is 00:47:05 and it was like with Looker, but when you have systems that generate SQL, like automatically generate SQL, you very easily end up with code that has been generated that's pretty hard

Starting point is 00:47:21 both to debug, to optimize, and also to understand in some cases. There are bugs also in these systems. And it is a hard problem to create code that's ideal for the machine, but also easy to interpret from the human side. So how do you do that? I mean, and what do you learn by trying to do that?

Starting point is 00:47:48 Because I'm sure you've done a lot and there's probably still a lot to be done and it's a very interesting area of research and development in general. Yeah. So, I mean, to your point, it is a journey. It is a hard problem. Obviously, there are trade-offs. And, you mean, to your point, it is a journey. It is a hard problem. Obviously, there are trade-offs.

Starting point is 00:48:07 And, you know, for instance, you may want to write SQL that is going to optimally perform, but may be then harder for a human to understand. And so making those trade-offs. A lot of what we've tried to do is provide tooling at sort of the higher level. You can understand just even in our interface, hey, what are we running? What generated this query? So you can map those things.

Starting point is 00:48:31 We also try to push all that down into the warehouse itself. So it sounds like such a simple thing, but actually even just tagging every query with a comment in the query text of what actually generated the query. So one of the things that I feel passionate about is trying to, you know, different audiences are going to choose different tools for how they want to discover these things, right?

Starting point is 00:48:52 So some part of the audience may say like, hey, I only want to use, you know, the warehouse's query plans to look at what's actually going on. For, you know, someone who's writing a spreadsheet formula, that is not going to be a good interface, right? They're going to want a much higher level sort of understanding. So trying to show things in the right language for the right user, but offer different choices. Because the reality is,

Starting point is 00:49:16 it's often going to be a team effort for things that are complex or working at scale. Yeah, it makes a lot of sense. Okay, one question, Rob, about you more as a founder, like someone who starts something from zero and takes it to one before you keep scaling, right? You make the decision at some point to use something like a spreadsheet, right? Like as your interface with a user out there. And spreadsheet is, okay, obviously something that it's like very well adopted, very well understood by many like users out there.

Starting point is 00:49:54 But at the same time, there are like very well established vendors, right? You have Microsoft in one side with Excel, which is huge, right? And you also have Google on the other side with a free version of Google Sheets. And you are a new startup, right? And you have these two behemoths

Starting point is 00:50:17 that they own distribution at a scale that is like, I mean, you can't even imagine, right? How do you build that? And how do you go to market with that? Absolutely. So, I mean, the first part to answer your question is, if you pull on the logical thread when you're starting any company, you'll quickly realize it's like a terrible idea, right? I mean, almost any company you'll be like, you know, you're going to work super hard. Your assets are really low. There's all these established companies already, you know, in this space. It's easy to basically convince yourself that like,

Starting point is 00:50:52 this is not a good idea. And, you know, also being fair, a super high percentage of startups fail. So there's a logical progression that just says this is never a good idea. And yet, obviously, we all know some startups succeed and make it. So it's hard to pull in the threads and get to the logical thing. I think for us, we tried to do two things that early on I think were important from how we went to market. And then there's also sort of the product side. I'm going to start with the product side,

Starting point is 00:51:22 and then I'll talk about the go-to-market thing first. The thing I want to mention on the product side, when you look at a term like spreadsheet, one of the challenges you have early on is how do you actually describe your product to people? And when I was describing Signet to be in this show, I purposely didn't say spreadsheet probably until about a minute into it. Because I think if you start with spreadsheet, people's orientation is, it's going to be exactly like this existing product. And then they're going to be disappointed

Starting point is 00:51:49 when they try it. And they're like, but it doesn't, you know, doesn't do exactly the same thing. And even though in their mind, they know that's obvious, they know it's something different. It's very hard. It's, you know, anchor yourself on one of those things immediately that they just expect to clone. From a go-to-market perspective, I think for us, it's interesting when you're doing a product that is in many ways a kind of universal platform, right? The way people use BI databases, spreadsheets, it's not just like, hey, it's only done in the finance department or it's only done in marketing. You know, we're not, there's not like a particular,

Starting point is 00:52:25 by looking at a breakdown even of our own users and customers, it tends to be a fairly even distribution between all of the sort of departments you'd suspect, finance, sales ops, marketing, HR, people ops, you know, product management. And even the use cases people have, you'll see some, you know, concentration on things like, you know, financial ops or supply chain management or embedding and building out sort of custom data apps. So you'll see these concentrations, but you'll also see like a long tail of just like people do all sorts of things with these products. And that makes go to market hard for a platform because, you know, the worst pitch is you can do anything with it.

Starting point is 00:53:02 Like your customer is like, that doesn't match my problems, right? Like I need a real problem. I think for us, it was a few things. One was for larger customers, you know, the enterprise customers we have, you cannot go in with a pitch of like, hey, you know, we're going to make you dashboards. Like they already have dashboards. Like this is not a problem you're trying to solve. So we had to find new problems that we were uniquely able to solve that like we were not competing against an existing product. We were competing against they couldn't solve this problem. And then we had to find motions in the mid market and below where we could just go head to head with existing products. And that was a strong flow for us. Obviously, partnerships were a huge part of it. So we've been, you know,

Starting point is 00:53:46 these data products are interesting because by themselves, they're actually kind of useless, right? Like what would you do with a warehouse that had no data in it? What would you do with an ETL tool that couldn't talk to anything? Or a, you know, BI product

Starting point is 00:53:58 that couldn't talk to anything. So you end up actually having to buy a stack. And that means you have a natural set of partners to go to market with. And so that's also been a huge part of our motion. That's super interesting. Okay. And let's talk a little bit about the future now, right?

Starting point is 00:54:14 BI is not like a new concept, obviously. It's been there for, I don't know, since we started having mainframes and databases and businesses that wanted to use technology to drive growth. What's next? I think if you ask someone about BI, the first thing that comes into mind is reports, visualizations, that kind of stuff. And at the same time, we live like in an era where like everyone's talking about AI right now. There is, okay, ML becomes like much more accessible, more and more people, like they can use ML as part of like working with data from spreadsheets, right? Like something that exists for a very long time.

Starting point is 00:55:03 It's very mature. It's something that everyone likes. It's like a lingua franca, let's say, of data for business users out there to AI and large language models or whatever. It's like the next thing that will come out in the next couple of weeks or whatever. What do you see there?

Starting point is 00:55:21 What's next for both the BI industry and for Sigma more specifically, right? Sure. I can start and then I prefer Stipo to chime in as well. I think, you know, as you mentioned, everyone's talking about AI. And I think what's interesting when you look at this, so, you know, we tend to, as an industry, to go through these different waves of kind of innovation or concentration of kind of what people are talking about. And obviously right now, everyone is talking about generative AI and chat GPT, like this, you know, even my mom is very interested in this. And the, so I think the

Starting point is 00:56:04 question we think a lot about is like, for the industry, how is that, like what's going to change? And are we going to see like a new set of players? Are we going to see, you know, some fundamental transformations? I think this is a tough one because, you know, on one hand, it's changing so quickly that like some level I'm like, I don't even want to touch predicting what's going to happen on the AI landscape because next week there'll be something so different. But I will say what I've seen so far, especially in the last year as there's been a wave of things, I've seen a lot of things I would describe as features that everyone is going to already has. And that's probably the, on one hand, it's been this huge enabling technology, right? Almost any product right now is building an outer has a natural language interface. And so I think if you're a startup and you say, you know, the only thing we, what we

Starting point is 00:56:54 built is natural language interface. I think that's a hard thing to be defensible on because I think every product is going to have that, right? Every day, some product announces, hey, now we have, you know, product foo GPT with our natural language interface, right? And so I think you're going to see every product that has that. You're going to see every product enhanced by AI. And I think when you look at the BI and data space, there's a set of common features that people are generally doing. And I think the existing players honestly will probably check the box on all of those. What I haven't seen yet is what is like the wave of features that like we haven't even

Starting point is 00:57:28 honestly maybe even dreamed up yet, but this technology is going to enable. And I think that's kind of the interesting thing to watch over the next six months to a year. And it'll be an opportunity of whether people react or not. I think a challenge for like new entrants here is that everyone's eye is on AI, which also makes it so that even all the big players are reacting to it. A lot of what you count on as a startup is that the big players are asleep and that you can sneak in and get past it. I think that's the challenge for sort of the new entrants here is that no one's asleep on AI. We're all looking at it.

Starting point is 00:58:02 Yeah, 100%. And they have the benefit of owning distribution which is huge right like i don't think i mean if you haven't tried to start a company you don't really understand like how important distribution is like you can't grab like you might say yeah i understand like all the millions of like users that already use Excel, for example. But unless you try to build something, you can't realize how important and what a luxury this thing is. It is super, super important. As you said, I'm also very interested to see how things

Starting point is 00:58:37 will change in the next six months to a year. Stipo, what do you think about that? I think thinking about this topic pragmatically, I think what the tool at our disposal is really human-grade processing of data without a human being involved. And what's so incredible about that is I think it just lowers the barrier to entry for anyone to be able to perform their analysis. So now you can immediately, you can get the context of the data that you need to work with.

Starting point is 00:59:17 Now, something that just came out of the warehouse, you now have a way to be able to get going with that, understand what's going on there. Despite the space of sort of where of cataloging already existing, you know, the convenience of being able to just ask someone or ask an AI and be able to get that context is just super, super critical. So there's this, there's so much, I mean, there's so much in the analysis

Starting point is 00:59:38 that is done today that requires processing. Like if you have data coming from two different sources and they need to be matched together, there's some amount of human processing that's required there. Is it really, should it be required? Well, not anymore, right? You have these tools that you can now lean on. And so I think, you know, we can always speculate on where's the, like the end state going to be before where the, where these tools are today. I think that's the place, that's the end state going to be. But for where these tools are today, I think that's the area that is going to be

Starting point is 01:00:07 the most impactful in our day-to-day. Yeah, 100%. All right, one last question from me, and that's for Rob, and then I'll give the microphone back to Eric. So, Rob, you mentioned at some point about the excitement you had when you first saw the promise of cloud computing back in 2007-2008. We are so many years after that.

Starting point is 01:00:36 Do you think that cloud has delivered? Are we there? Are we done with cloud actually? Or there's still a lot of potential when it comes to the cloud part of it? For me, the amazing thing was just how fast you can move, right? And so if you think about if you were even just starting a company or starting a project at a company 20 years ago, or even more. When I first started my career, we wrote all the software in our stack. We didn't use it.

Starting point is 01:01:09 There was no kind of equivalent of open source. You had to build everything. And then you had to get all the hardware to deploy, all these things that you sort of do. And so for me, there's something amazing about how quickly you can get started on things, how quickly you can iterate, how quickly you can, you know, spend your time on sort of building out new things for value for your users versus spending time on sort of honestly like the

Starting point is 01:01:34 same old problems. And so, you know, I think we are well on our way. I don't think we're anywhere near done yet. I think in some ways we're still learning a lot. I think it's part of the, honestly, the great part about this industry, right? Is that we're actually, I think in a lot of ways, learning how to together move faster, right? In some ways, open source is a way that we saw sort of learned is like, you know, if you talk to a lot of people and said, what if we just gave away our product for free? It sounds crazy. And yet developers have sort of learned that there's a huge value in doing it for a lot of developer-oriented things because it's a way to promote community. It's a way to build on top of things.

Starting point is 01:02:15 It's a way to have building blocks that kind of act like compound interest and build on top of things. And so I think spreading those ideas further, there's a lot of opportunity there. 100%. Eric, all yours. Okay, I want to ask, this is a little bit going back to the AI question, but maybe I can turn this into a good concluding question. One really interesting thing about Sigma

Starting point is 01:02:43 is that in many ways, it gets people a lot closer to the data than SIPO when you were talking about traditional BI. You have this vertical integration and the consumer's at the very end, and so they're actually not very close to the data. When you think about giving someone a spreadsheet interface, it gets them very close to the data. And anyone who's tried to answer a question knows that in terms of ergonomics, looking at the data and just trying to wrap your mind around looking at a spreadsheet gives you a sense of what questions to ask and orients you in many ways to actually perform analytics, maybe even before you start doing analytics formally. One thing that's interesting about some of the AI, especially the chat interface that sort of has become the predominant paradigm, is that it obfuscates a lot of that, right? So when you think about closeness to the data in terms of

Starting point is 01:03:46 Sigma, and in terms of analytics in general, I just love to conclude on hearing about, like, how do you value that as someone who works with and uses and produces analytics every day as part of my job? That's really valuable to me. And I think it's an important part of it, yet at the same time, it can turn into a lot of, you know, unnecessary work. How do you think about closeness to data?

Starting point is 01:04:12 I guess a few things jump out at me. You know, as you were talking about the spreadsheet interface and I mean, there, there are, there are funny things, right? Like you,

Starting point is 01:04:21 if I just showed someone who was not, you know, it was a very much a novice in analysis, I just showed them some column and it has, you know, some data value that's very different than the others. Like their eyes would naturally go to that. Their questions would naturally go to that. They don't necessarily have to have a concept of, you know,

Starting point is 01:04:36 standard deviations and outliers or anything like that. But it's just, it's sort of human intuition to be curious about certain different things. And I think, you know, early on, sometimes we thought about like, what's the right interface for curiosity? And so at some level, an interesting interface for curiosity is a chat interface because it's a very lightweight interface for asking like, you know, I'm generally curious about like, how are we doing in marketing?

Starting point is 01:05:02 And so it's a very good interface for that. And it's a very good, I think, paradigm for sort of getting, hey, here's like a general high level, you know, here's how are we doing in marketing? And so it's a very good interface for that. And it's a very good, I think, paradigm for sort of getting, hey, here's like a general high level, you know, here's how we're doing in marketing. It's not necessarily a great interface for like a very detailed set of things. And so I think you're always going to see a spectrum

Starting point is 01:05:18 of like, what is the right, for lack of a better thing, tool for the job? But, you know, I love getting people engaged. I think one of the things that I've always been passionate about when I watch spreadsheet users is a spreadsheet user gets in front of the data and it's like they kind of almost want to like hug their Excel and they're like, you know, now that I have it here, I can do it. I can do anything.

Starting point is 01:05:41 And they like just jam out for like the next 30 minutes. And I remember watching like early Sigma users, I'd see the same thing. I'd watch them like, you know, do like 3000 things in a row. That type of thing and building out those types of interfaces, right? I don't necessarily want to type 3000,

Starting point is 01:05:56 you know, questions into chat GPT, but if I could type in like three really interesting ones that start me on a journey, that's a great, that's sort of a great win. Yeah, no, that makes sense. I think when you think about hugging the spreadsheet, I actually think it's a very apt analogy because in some sense, when you encounter a new data set, you need to understand the physical dimensions of it, if that makes sense, right? Like it's almost like, you know, walking into a new house and you kind of need understand, like, you know, how it's arranged in the size and other

Starting point is 01:06:28 things like that. But at the same time, like having an interface that encourages curiosity. Yeah, that's a wonderful answer. Sipa, any additional thoughts before we close it out? Yeah, the thing that I had out here is it's really about how people think, right? There's people who are very verbal, and they think in words. There's people who think in images and diagrams. And I think when it comes to data exploration, it's very tough to do it all verbally. And so in this regard, what I sort of expect is, so first off is you're verbally asking all of these questions, but the second off is, you know, there's, you need to have a level of transparency of what. And just in the same way that if you show up to a meeting, someone gives you aggregate numbers, you want to be able to drill down and actually understand how the heck did they come up with

Starting point is 01:07:31 the numbers, especially if it's something that seems amiss or stands out as an outlier. And so to Rob's point, I would just sort of underscore and agree that, you know, there's, it's a great place to sort of start for a high level and to ask the first question, but that those deeper questions and the way that those spreadsheet huggers work, you know, you really need to be able to see the data and the way that you sort of iterate on the next question to even your point when you were describing this question is that, you know, you sort of see the data and that drives how you interact with and what you're going to ask next and what you're going to do. And you don't need words, you just need the actions. And almost that going from what you see and what you need to do

Starting point is 01:08:14 to translating that to words, to then to data and going back and forth is almost a suboptimal way to go back and forth. But it's really sort of curious to see how technology will kind of progress in this realm. Absolutely. Well, we are very excited to see what you and the team at Sigma build as we figure out how these new technologies are going to influence analytics. So thank you so much for giving us time. Wonderful episode. And we'd love to have you back.

Starting point is 01:08:46 So let us know when you can come back on the show. All right. Thank you both. Thank you. Well, Costas, what a fascinating conversation with Steepo and Rob from Sigma. They have built a modern analytics tool that is a spreadsheet and SQL interface. So relying on the two most common paradigms for modern analytics. And it was fascinating to hear them talk about

Starting point is 01:09:17 how and why they made the decision to pursue that kind of interface. I think two things stuck out to me. One, they didn't start there. That seems like an obvious decision. Well, let's just bridge this gap between the spreadsheets and SQL, which have the most usage in terms of analytical interfaces than any other tooling in the world, but they didn't actually start there. Rob, as a founder, told us that they went through many iterations and did a ton of research, watched tons and tons of analysts use a bunch of different tools, and ultimately came to that conclusion. That, to me, was fascinating. The other piece that I really appreciated was they really had kind of a humble are pretty opinionated because they maybe think that

Starting point is 01:10:28 they need to create a lot of guardrails for the end users because they sort of know how analytics need to be done better than the end user. And Rob was pretty clear that's not actually true and that he seems to have a fundamental belief that that's a pretty bad way to build products in general and you know from their success it's clear that i think they made the right choice and sort of um you know giving different kinds of advanced tooling to different users so i loved it it was a absolutely great episode. Oh, yeah. 100%. I mean, there are a couple of things that I found very fascinating. It was a wealth of, let's say, wisdom there from some people with a ton of experience in the industry.

Starting point is 01:11:28 I really enjoyed the part where we talked about the differences between the relational model and the spreadsheets and how these can be bridged. What does this mean in terms of the product that we are building, in terms of the experience that you try to build there? And there were some very interesting insights there. But also, from a technical perspective, what it takes to engineer a system like that, which is also super, super interesting and helps us understand the difficulty and the complexity of the problem

Starting point is 01:12:06 that we are dealing with. At the same time, it's very interesting to hear that it is a journey. Solving a problem is always a journey. That's what I really keep from this conversation with both Stipo and Rob. We have done so far like an am like i would say and they've done an amazing job they didn't say the word amazing because like they are

Starting point is 01:12:30 very humble but sigma is like an amazing tool in terms of like how like the experience that it delivers at the end but rob was like very explicit on that there's still like a lot of work to be done there and many things that like can be improved and the other thing that i'd say and i'm not going to disclose exactly what because i want to keep that as like a surprise like for our audience is that there's also like a hidden gem in this conversation about go to market and building products and companies, especially when you are going out there to compete against some very established and big companies. So I would recommend to anyone like to pay attention there are like some very interesting parts

Starting point is 01:13:29 of the conversation not only from the technical and the product perspective but also the business perspective I agree all right well thank you for listening another great episode in the books subscribe if you haven't on your favorite podcast platform and tell a friend if you haven't on your favorite podcast platform.

Starting point is 01:13:45 And tell a friend if you haven't. We always love getting new listeners. And of course, give us feedback. You can go to the website, submit the form, tell us what you like, what you don't. And we will catch you on the next one. We hope you enjoyed this episode of the Data Stack Show. Be sure to subscribe on your favorite podcast app to get notified about new episodes every week. We'd also love your feedback. You can email me, ericdodds, at eric at datastackshow.com. That's E-R-I-C at datastackshow.com.

Starting point is 01:14:18 The show is brought to you by Rutterstack, the CDP for developers. Learn how to build a CDP on your data warehouse at rudderstack.com.

The Data Stack Show - 143: Collaborative Data Analytics on the Data Warehouse, featuring Rob Woollen & Stipo Josipovic of Sigma

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.