The Data Stack Show - 167: Data-Driven Investing and Company Building with Ben Miller of Fundrise

Episode Date: December 6, 2023

Highlights from this week’s conversation include:Ben’s background in real estate (3:27)Why Fundrise was Started (4:37)Democratizing Investment Opportunities (6:35)Investment Thesis for Venture (11...:55)Challenges with Data and Technology (12:34)Importance of Data Model Abstraction (20:03)Data Infrastructure and Investments (23:22)Evolution of Data Engineering (25:12)Closing the Tooling Gap (34:23)The user base segmentation (36:28)The emotional reality of investment decisions (40:50)Data inputs for real estate investment (47:07)The work of data infrastructure (48:28)The limitations of underwriting analysis (49:36)Improving accuracy with data infrastructure (52:43)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

Transcript
Discussion (0)
Starting point is 00:00:00 Welcome to the Data Stack Show. Each week we explore the world of data by talking to the people shaping its future. You'll learn about new data technology and trends and how data teams and processes are run at top companies. The Data Stack Show is brought to you by Rudderstack, the CDP for developers. You can learn more at rudderstack.com. Costas, today we're talking with Ben Miller, who's the CEO and one of the founders of Fundrise, which is a really interesting company. It democratizes investment opportunities for assets that are typically off-limits or very
Starting point is 00:00:42 difficult to get access to for your average person, right? So let's think about, you know, a big corporate real estate development on one of the spectrum, but fascinatingly, venture capital investments in tech on the other end of the spectrum. And you can do both on Fundrise. And there's more. And I'm really excited to talk with Ben, one, because they're an extremely data-driven company. And he's a CEO, so he's very technical, but he doesn't have a day-to-day role in the data space. But they're extremely data-driven. And one thing that I'm personally extremely curious about is with a portfolio of assets that diverse, you're dealing with pretty different types of data, pretty different types of data modeling,
Starting point is 00:01:30 very different data formats, right? If you think about the real estate space, probably still a lot of PDF documents and other things like that, right? So I'm interested to ask about that. I'm also interested in the combination. I mean, how do you run a platform that allows you to invest in an apartment building and data breaks, which is crazy. So that's what I'm interested in chatting with him about. How about you? Yeah. I mean, we've talked with many people all these years about architectures, technologies, like how we use data, like how we process data, you know, like we tend like to focus more on like the, let's say like the technology side of things and what is like really exciting going on there.
Starting point is 00:02:15 And we take for granted like the connection with how like value is like created out of data. But that's what like I find like find so fascinating about the episode that we have with Ben recorded, that we literally have one of the best examples here of how data directly connects and working with data
Starting point is 00:02:35 directly connects with value creation. And of course, the financial sector in general is a very good example of that. But here we have a person who can take us like through more unconventional uses of data as like the business also it's like unconventional in terms of like how like offers access to investment and yeah that's what i'm
Starting point is 00:02:56 really excited about i really would like to hear more about how we start like from the raw data and we end up like in creating creating really tangible value that we can see as dollar signs in our bank accounts, right? So that's what really excites me about this episode. All right. Well, let's dig in and talk with Ben. Let's do it. Ben, welcome to the DataSec Show. We have a lot to cover and we're going to talk about data in multiple different contexts, but give us your background. Okay. Well, so I'm currently CEO and co-founder of Fundrise, but I got here over a
Starting point is 00:03:33 long period of entrepreneuring. So I started in real estate, private equity, venture capital, worked for a tech startup that went from zero to a hundred people back to zero in 36 months. Wow. I've worked in, yeah, real estate development, doing urban mixed use, like huge skyscraper type buildings in the middle of like cities. And that went through the 2008 financial crisis and then Fundrise, you know, we started it in 2012. We have 275 people at the company. We have 2 million users, I think 20,000 apartments that are owned through the platform.
Starting point is 00:04:16 Wow. And yeah, so basically, I feel like I've aged. I feel like I've lived 50 years. Yeah, it sounds like it. So tell us, why did you start Fundrise? I mean, you obviously had a background in real estate, but Fundrise is not just about real estate. So tell us about Fundrise and why you started it. Well, so anybody who went through the 2008 financial crisis came away very skeptical of mainstream finance like it's kind of lost of any confidence in the financial system yep and so i wanted to kind of build essentially an alternative to it that you know my dream was to basically build an alternative to mainstream finance and that mean meant like creating a way for people to invest in things that are
Starting point is 00:05:05 like real estate or like private tech or credit, things that basically super high net worth or institutional investors do. And they're sort of less correlated or less connected to the stock market and the bond market, which are what normally people invest in. Yep. Yep. Okay. So I just have a question around the mechanics of this. So when you think about private investors or high net worth investors, and you can correct me if I'm wrong here, but generally there's sort of a, and maybe I'm thinking about this more from like a deal flow standpoint, right? Like people fight over deals and you're trying to make a good investment. You have money to throw it at.
Starting point is 00:05:55 And so there's, you know, it seems like that sort of a closed door network where deal flow happens a lot through word of mouth or other methodologies, right? Where the public stock exchange is sort of on the other end of the spectrum, where you can go on E-Trade and you can buy whatever you want. How do you bridge that gap? Because it would seem to me, and just to be a little bit more pointed, and if this is too spicy, let me know, but people who are getting good deal flow tend to try to protect their system, right? And so you democratizing that, how does that work? Yeah, that's a great question. Well, so partly I came from that system so I can go into that system and get a good deal. So that used to be my job. And so that's part of it.
Starting point is 00:06:50 And then there's a story that those people tell about how they have this deal flow and they're proprietary about it. But that's mostly story. The reality is if you come in with a checkbook, it turns out that you can- It turns out that money talks. Yeah. Surprise. Yeah. And we've done that now across multiple asset classes.
Starting point is 00:07:16 We bought billions of dollars in real estate. We've walked in the room and done hundreds of millions of dollars in very sophisticated credit instruments, aspect securities. And we were buying and buying in a, this is 144A transactions, we had to have a hundred million dollars in net worth. And it's like, know talking about rarefied yeah yeah i mean that's yeah most people most people are rare you know yeah bloomberg terminals and stuff so and we've done that and we've also done venture where which people told me it was the most exclusive and yet we you know we own we've invested in dbt labs and databricks and Canva and ServiceTitan and Vanta. And so we, so, so yeah, I mean, it's, it takes skills and it's not like, but it's not as rarefied ultimately as it, as they want you
Starting point is 00:08:14 to believe it is. Man, I love it. That's sort of a great, sort of a great story of like democratization. We're also just sort of the spirit of like if you show up with a checkbook i love that analogy actually it's not an analogy the reality of that because that's sort of what you're you know enabling people to do by proxy we have a lot of things to talk about from a data standpoint but of course our listeners know i can't not pick apart the business model a little bit so So, you know, and look, I'm not, this is not a show about investment expertise and I'm not an expert investor, but I know enough about asset classes to know that real estate and venture are sort of at like really
Starting point is 00:08:57 extreme expense, you know, pretty extreme ends of the spectrum. Right. And so if you're investing in commercial real estate, you're generally looking at sort of decades long, almost an annuity type approach when you're thinking about the asset, like far more durable, much longer cycles. And then on the venture side, you said that you work for a company that went from zero to 100 employees to zero employees in 36 months, right? Now, of course, if you're looking at the Databricks of the world, of course, they've reached a level of stability. But can you reconcile that for me and the listeners just to try to understand? Generally, investors make money by specializing in a particular asset class. And you seem to have achieved success by democratizing access
Starting point is 00:09:47 to multiple different types of asset classes. Yeah. God, that's such a fun question. Well, so there's a great saying that I don't know if you've heard, but basically says, in venture, you make 100 investments, you lose all your money on 99 and you make them all back on one. Sure. That's how the unit, that's the economics of that. Power law. Yeah.
Starting point is 00:10:13 And in real estate, you make a hundred investments, you make money on 99 and you lose it all on one. I hadn't heard the back half of that, but probably because i haven't been involved in a ton of commercial real estate because there's so much leverage there's so much leverage in real estate there's so much debt so so yeah there's a lot of bad them that are really opposites and you know if i hadn't worked for a tech company and they went you know up and down and if i hadn't been spent basically another 10 12 years building company funderize like i you know i don't know if i'd have the background in tech but after probably
Starting point is 00:10:53 15 years of tech and venture i have a pretty good sense of like of at least mid to stay mid to late stage venture yep or you might call it growth equity. Sure. And one of the ways we generally have been investing, at least on the venture side, is that we use the tools. We have 100 software engineers at Fundrise. Wow. 100 software engineers.
Starting point is 00:11:22 Yeah. Yes. It's actually funny a lot of companies we got to them because we were wow customer and we're like this is such a good technology we got like dbt right oh my god we have to listen to this company we have to listen to this company this company is like a one-of- It's really, I mean, I think it's going to end up being the central nexus of the revolution that's happening in data. And we just like hunting them down. Yeah. Oh man, I love that. I love that. Okay. So that is... Okay. So would you say, just to dig in on that a little bit more, do you have an investment thesis for venture as part of Fundrise and let's just zero on technology like a DBT? Or is your investment thesis like on the ground value that you can...
Starting point is 00:12:23 You essentially have a lab as an investor. I'm just trying to put my investor hat on. Like you have a lab of a hundred software engineers who are managing and we'll get to the data part. I promise to our listeners, we'll get to the data part, but I can't imagine that. I mean, first party data from apps, third party data, I'm sure it's insane. I'm sure you have like actually pretty severe, like challenges. And so you essentially have a lab where you can like try a bunch of different technologies. And
Starting point is 00:12:52 so you almost don't necessarily have to have a thesis as much as your team can say, like, if we had to pick a winner, this would be it because we're doing it on the ground? Yeah. It's funny. I have this view and I repeat it constantly internally, which is when trying to analyze something, you want to analyze it from the bottom up and then from the top down. And we need to do both. And you actually need to iterate on both because often when you do one and the other, you find they don't actually match. And you're like, well, the bottom up and the top down shouldn't give you different answers. And as data people, you start to, you know what I mean.
Starting point is 00:13:30 And so we have top down because basically one of the sort of, I think mythologies in investment is that there's all this alpha, but I actually think it's mostly beta, which basically means that the macro, you know, if you're investing in cloud over the last decade you know you probably did pretty well investing yeah in like you know crypto is enabling technology basically didn't work like so so like as good as a company picker as you might have been like this the macro is such a big driver of returns like you just bought
Starting point is 00:14:02 fang for 10 years you're oh, you look so smart. Right? Yeah, yeah. So, and so that's like, I think it's very true in real estate. Like if you just had
Starting point is 00:14:11 bought apartments in, you know, the Sunbelt, like everybody moved there and everybody lived there and inflation drove up tons of rents. So,
Starting point is 00:14:21 so that's like half of my belief. The other half is a bottom up and the bottom up is like with the software, we're using it. We're like, right now we're in the process of picking a, I don't know what you'd call it, but basically a reverse ETL. So we're looking at all the companies and I'm like, wow, sorry not to throw any shade
Starting point is 00:14:43 here, but like, wow, Segment really is in trouble. Everybody talks about Segment being so great. It seems like, oh man, Segment's likely like, doesn't have a future. And so I just believe you have to do both. And that's how we do it. And so I have a strong, I mean, our thesis has been modern data infrastructure.
Starting point is 00:15:04 We've done a little bit of Prop tech to investments because not much, because we know a lot about property and technology, but our top down thesis has just been the modern data infrastructure. And now an AI combined with modern data infrastructure is just, it's a, it's a wave, like as big as the computer was and so then we're not venture and like having to pick three people and a dog right we're like we can that's really hard but if you pick like you know our investments like like five fan we want to invest in five fan that doesn't take there's not like that much imagination in that yeah yeah and Yep. And so, and I, and if I were, you know, I'm going to pitch you, I'm going to say, hey, you're in the data industry.
Starting point is 00:15:49 Like, shouldn't you be able to invest in Fivetran and Retool and dbt and Databricks? Like, yeah. Yep. Yep. Super interesting. It's refreshing to hear. And again, this is sort of like total armchair investor, but the logical conclusion with those two things would be that you would invest in property
Starting point is 00:16:13 technology, right? Management, all that sort of stuff. But it's actually great to hear that you're just sort of following the things that you are finding actual value in and not trying to like wedge something in with these two, like sort of core competencies, which is great. One question on segment, because segment in many ways is like a really great product, but you're evaluating reverse ETL. Like why do you, why do you have that perspective? And I'll direct the question a little bit in terms of, do you think, because they're a pretty large business, actually.
Starting point is 00:16:48 Do you think the innovation has slowed down just because they got acquired and it's much more difficult to innovate from a product standpoint? Oh, okay. Well, so our head of chief product officer and one of our senior engineers has been looking at it. And so I could tell our
Starting point is 00:17:06 story our data story is we basically just probably live the story many companies did but we what we don't what we want is very different like so we looked we're looking at high touch census and rudder stack so it's and and so i'm not sure i can do it justice to basically explain to people who are really technical but the difference of basically like we want to have control of our own data we don't want to shove our data somewhere else yeah right and that's probably the main problem yeah i think what that's and and that's why we you know we're looking at the companies that allow that yep yep sort of more of like a warehouse native like the sort of you have control on your you're sort of building an infrastructure around your snowflake or databricks environment right
Starting point is 00:17:58 exactly and be able to like create audiences and shove more data you know like you use iterable we can shove an iterable we use zendesk for customer service we have tableau and looker and we just we basically need to be able to take the data and we want control the way we want and shove it to where we want and not be constrained by like you know sure some antiquated approach yeah no i mean i think i was actually a guest on a completely different podcast recently and we were sort of talking about how abstraction is the way of the future right because the big challenge is that if you get into a situation where you're beholden to, let's put it this way, anytime that you're storing data in a third-party cloud system, you're beholden to what I believe,
Starting point is 00:18:55 and actually I'm interested in your opinion on this. There's sort of two major problems that I see with that. One is that they have to choose a data model or a schema, right? You can't store the data without having an opinion on that. And in most environments, you don't have control over that unless it's your own data warehouse or data lake, in which case you can define your own data schema that matches your business logic to a T, right? But if it's in another system, they have to decide how to store their data. And it rarely matches. That's a huge pain point. The other big pain point, which is maybe not equally as painful from a fundamental standpoint, but from a practical standpoint, is that you're subject to a gatekeeper right as opposed to having access to the raw data those are two like major things that i'm with you and that like if you're not building
Starting point is 00:19:52 towards that or thinking about that it's going to be hard because companies just are becoming they have way less of an appetite for dealing with that It's just not acceptable anymore. Yeah. And like, so now we're getting into the data, which is like, I've become obsessed with. So the data model, that abstraction is so essential to being able to have the right downstream business conclusions
Starting point is 00:20:21 and business like abilities. And so like, actually i'll start with like even though we built the our web apps and our mobile apps first we had like scale on that side you know before we got to scale on the real estate side it's the real estate side where we really got i got more sophisticated about data. We went about creating a data model that we thought reflected all of how real estate works. So it wasn't constrained to... And this is a true thing for companies too. So you have operational data, you have accounting data, you have market data like so you have operational data you have accounting data you have data
Starting point is 00:21:07 and you have a kind of sometimes you have various financial data and all those things sit in different places yep and and are like pretty different in and of themselves right like oh yeah they're different i mean there's the sort of pipes challenge of getting it and joining it and cleaning and all that stuff but there's also just like then you have to have a, but it goes back to the data model. You have to have a sense of how they all relate to each other in a way that's as abstracted as possible so that you basically don't, I mean, the mistakes we've made over the past 10 years is we baked into the wrong place opinions. Can you just give an example of like,
Starting point is 00:21:47 what is an opinion that you baked in? And specifically, where did you bake it in? And why was that problematic? Like at what point in the pipeline did you bake it in? I'll give you a couple. I'll give you one that's sort of more data and one that's more business. So when we first started, we have like, you know, thousands of apartments and every apartment
Starting point is 00:22:08 has millions, basically millions of transactions. Like, you know, you paid 15 cents for water that day. And so we have all of that data. And the utility. Right. Because everything. Wow. Yeah.
Starting point is 00:22:20 Okay. So you know, you had to replace the unit level. So you have like almost a building level and then a unit level. So we're talking about like, and then you have a tenant level, and then... Yeah, it's so funny. Holy cow, yeah. Yeah, this is like what I was saying to you earlier, like basically everything in the world is data. Yeah.
Starting point is 00:22:49 Like, right? The whole universe is data and just and so if you're in an industry and you care about outcomes basically you want to figure out how you can model that subsect of the universe so that you can start to get good at at predicting what's going to happen and understand what went wrong. And so in a property, right, when you have, there's, you know, they're like a city, you know, a thousand people might live there, all sorts of things happening. And you're trying to basically get good outcomes at the, and that's a combination of people feeling good and people paying the rent and people having everything fixed. And then next door, they may be building a whole foods
Starting point is 00:23:30 or they shut down the street because there's traffic, because they're going to modify the road. All these things happening in the built environment. And that built environment also has things happening virtually. You find an apartment by looking on the internet.
Starting point is 00:23:45 That leasing traffic comes through the internet. There's just all these flows of data around a nexus, which ultimately is a property. And if you can get really good at capturing all that data and modeling that data, you can actually just make better investments and you can build infrastructure that ultimately I think everybody wants to use. And when you say infrastructure, you're talking about physical...
Starting point is 00:24:10 Well, I meant data infrastructure. Okay, data infrastructure. But it's both. I mean, you can get to both. But ultimately, physical infrastructure comes from a data decision. An insight that you get from information. Yeah.
Starting point is 00:24:28 And when you say data infrastructure that people like to use, there are multiple potential audiences that you could mean there. So like, who are you thinking about? Were you thinking about like people who are data professionals? Are you thinking about the tools that the actual tenant uses to streamline their, you know, sort of engagement with you as a landlord? Right. Yeah. So, okay. When I listen to your show, right, you usually have people who are technically building tools.
Starting point is 00:25:05 And what's happening in the data infrastructure industry is that there's like this tooling or technology being built that really, I mean, like they're, to me, they're like revolutionary. I mean, they're just from where it was. Yeah. You talked about DBT, like as a shining example. Sure. But I mean, if you go back, I mean, let me just digress for a bit and I'll come back to what you're asking because it's almost like you have to answer it from the top down. And so 10 years ago, when we built Funrise, we have 100, we currently have 100 engineers,
Starting point is 00:25:33 software engineers. One is a data engineer. No way. Are you serious? Because you have what you said, what, 200 to 300 employees, right? A couple hundred employees, 100 software engineers, and one data engineer. That's shocking. Yeah, because the organization is like data engineering or data.
Starting point is 00:25:55 It's like a separate segment. And mostly we have application engineering, and we basically process billions of dollars and we push information to our web application and our iOS. A whole team that just maintains performance of your real-time performance of your portfolio
Starting point is 00:26:18 and we do a billion data points, processing and stuff like that. Those are data products like yeah you're delivering a data product inside of an app essentially yeah but they're not you know the idea of like a separate you know we have a cloud ops team right yeah an identity team but we didn't but there we didn't have data engineering as its own resourced department. Yeah, interesting.
Starting point is 00:26:50 Just one quick perspective on that, and tell me if I'm off here. It's interesting because you're delivering a bunch of data products, which is almost like a software subsidiary in a way of your apps, right? You sort of have like an end user and you're delivering different features and products within the experience that they have, which is interesting, right? Like centralized data engineering isn't necessary to deliver a data product to an end user, right? Data is just an input that sort of feeds like a user experience. And that feeds a user experience. That's fascinating, actually.
Starting point is 00:27:28 That's the layering there. Yeah. Well, I could be all wet on this one, but I just think the evolution we're going through is happening writ large at other companies. Just like if you go back to 10 years ago, we had a monolithic code base, and we ended up with microservices. And so that's sort of what's happening in the data industries. The first phase is you build out pipes with Fivetran, get DBT in place. And then now we're going to do the reverse CL. So you're kind of working your way through yeah like the natural evolution and so like but what where i sit is that the application layer or the business layer where we're going to use the tools
Starting point is 00:28:12 that are being invented by companies like databricks and dbt to build applications for people who are not data professionals yep right. Now, is that the end consumer? Or when you say not data professionals, define that, because that could be someone who is on the Fundrise app and they want to make an investment and I don't care,
Starting point is 00:28:36 or that person needs the data to make an informed decision, or you're sort of presenting them with the ability to make an informed decision? Right. Well, so there's lots of different ways to do that. We have two. We're definitely one is essentially it's just an internal user who can like make better
Starting point is 00:28:53 marketing campaigns and decisions about products and just have data. And we are the way we did it before we had our database and we just dumped it directly into Tableau in Tableau's memory. Like, we're just like, here you go. And then we had like, by splitting that up and using the logic out of Tableau, like we can like, so there's like things that we didn't, that we're like kind of learning. But then, but I'm like the, I go to real estate and I, when I think about real estate, I think about finance. Real estate, commercial real estate, apartment buildings, and industrial buildings, that is a form of finance mostly. Yeah. And so I gathered the stat before this podcast.
Starting point is 00:29:35 So there are 100,000 data analysts in the country. In the United States. In the United States. That's their job title, data analyst, no matter who they work for. I think it said I think the sentence is 93,000 so okay and there are 6.5 million
Starting point is 00:29:51 people in the financial services industry working what so and I think that those two professions converge yep I think that those two professions converge. Yep.
Starting point is 00:30:07 And in the same, like basically in the same kind of pattern, which is that for us, I think for most people to get the data, we had to originally go to engineers. Like, can you run this data custom report? And then we could have got to a place where now like a data analyst can write SQL or, you know, and get the data for us. Yep. And then I think we'll,
Starting point is 00:30:30 we want to get to a place where then like a person like me or a person in the marketing department can get the data. Yeah. Without knowing SQL. And then, you know, what's going to happen is that like, so then that's that person who's like, me is like an Excel user, like a self-spreadsheet, basically how they think about data. And then what's going to happen is that it's going to be done by AI with natural language. Yeah, I agree with that.
Starting point is 00:30:58 So that progression is happening. And that progression is the progression of all technologies, which is that it becomes mass user. So it gets really cheap, and it gets really easy. And so the data infrastructure industry is sort of in the mid-phase, is now graduating from it being deeply technical to being sort of a business person starting to get at it. Yep. And that's what I want to do for real estate and finance. That's where we're going for finance.
Starting point is 00:31:34 So a financial professional should basically know in five years, they shouldn't be using Excel, which they rarely. It's so clunky. Yeah. Okay. So a couple of thoughts there. This is a conviction that I've held for quite some time. So I think the statistic that you mentioned is, I think it's true, but I think it's actually sort of a gross mislabeling problem. And let me tell you what I mean by that. My guess would be just based on hearing your background, that you have built things in Excel that on paper are essentially software, right? I mean, you probably like essentially built
Starting point is 00:32:27 software in Excel, right? Whether that's, you know, the ability to sort of model something or analyze data you're using, you know, sort of like macros and, you know, and some of the best software developers that I've worked with actually came out of an analyst background, quant background, and are just Excel masters, right? And that gives them a fundamental understanding of the relationship of data and how to express things through logic, which is really interesting. The gap has been a tooling problem, which I think you highlighted really well, right? Like you have an analyst who's really good at SQL and they can wrangle Tableau, right? And so they're like labeled an analyst. And then you have someone who is trying to build multiple scenarios across multiple
Starting point is 00:33:21 connected Excel files to try to predict, you know, to predict margin on a large commercial real estate investment that has thousands of inputs into the model. It's like, well, where's the skill set difference? There's certainly flavor differences, but you're just talking about different tools. You're talking about people who are sort of producing similar types of work. They're just using very different tools. And I agree that it's so exciting that's starting to converge where it's like, well, those should not be like separate
Starting point is 00:33:55 and really Excel like, you know, should sort of fade into the background. So I agree like That statistic is fascinating, but I also think that's a tooling gap and it's starting to close. I think it's probably maybe earlier than you. You sort of said we're in the middle phase. I think we're still in pre-middle phase,
Starting point is 00:34:19 pretty early, but maybe that's just my perspective. Well, I guess my goal is to try to move us to the next phase, to build a product or products that start to leverage the more technical products like DBT or Databricks, both of which we invested in. Oh, wow. Okay. So, Fundrise, you gave people the ability to invest in both of which we invested in, in ART. Oh, wow. Okay, so Fundrise, you gave people the ability
Starting point is 00:34:46 to invest in both of those companies. Yes. Both Databricks, dbt, we've invested in ART through one of the ART technology strategy. And both of those companies are enabling technologies that if you can basically get the right data into their tooling
Starting point is 00:35:17 and then get that into the right UI so that's basically, I mean there's a lot more than that happening, right? But that's where you can start to basically democratize access to the innovations. Yeah. Okay. I have a funny question about, this is a quick sidebar, but then I want to ask about third party data because the real estate aspect of that is really fascinating to me. I'm just thinking about a Fundrise user, and I'm thinking about investing in an apartment building, and I'm thinking about investing in
Starting point is 00:35:55 Databricks, right? And I'm thinking about the number of people I know who have an appetite to invest in both of those things, because traditionally, when you think about the way that people want to allocate capital, like an individual who wants to allocate capital, you're going to play to your strengths, right? And so you have people who are like, they invest primarily in real estate, right? And then the other end is venture or whatever you want to call it, right? Angel investment. You have a large appetite for risk and you have, you know, your sort of portfolio like
Starting point is 00:36:33 affords for you to sort of make a wide investment in a number of things that are like asymmetrical. And so you have a larger volume of investments, but, you know, and you're willing to lose because one of them might be asymmetrical in terms of returns. It seems like Fundrise is a place where you're presenting an opportunity for someone to sort of do both things, but is that the actual user? How is your user base segmented? I'm just thinking about myself, actually. It's like, well, I actually would kind of be interested in making both of those investments for different reasons as part of my portfolio. But maybe your average investor isn't thinking about investing in commercial real estate and Databricks, right? And your traditional commercial real estate investor, not to stereotype, but may not know why Databricks' technology is you know creating so much value for their company. Yeah
Starting point is 00:37:27 that's why we need high touch for one of these companies so we can create better audiences. Yeah. We got to create the audience in Databricks you know whatever anyways. Yeah. And because we want to build more personalized
Starting point is 00:37:43 experiences. Yeah. But in terms of just your gut sense of your users, it's such an interesting dynamic of extremes of the spectrum on this single platform. Yeah. I mean, we have a lot of investors. And so what happens is there's a lot of use cases. I don't like personas because there's just too many.
Starting point is 00:38:09 When you have a million and a half million investors and people invest, this is one of the things. They invest. What they're doing in 2021, the middle of the pandemic, when print money is being printed, and what they're doing right now is so different. Oh, yeah. Even the same persona is a different persona. Yes. Or they're just,
Starting point is 00:38:39 so their behavior is much less consistent than they believe or any kind of like, you know, archetype modeling would believe. Interesting. And I think that like, if you put the, I mean, right now, archetype modeling would believe. And I think that like if you put the, I mean right now everybody's talking about AI and a year ago no one was talking about AI. Yeah. And so like, you know, we've invested
Starting point is 00:38:55 in some pretty good AI companies too, which we haven't announced yet. And if I were to start talking about like like, you know, RAG start talking about RAG and why this company is so central and OpenAI uses them and people a year ago would have just been totally not interested. And not persona driven, right?
Starting point is 00:39:19 I mean, this goes a bit of the top down, like the macro, high interest rates, oh, I'm making 5% of my savings. Maybe I'm not going to invest in anything. There's so many things affecting that investor that it's not as driven by the persona or psychographic demographic stuff. The problem with the analyst is they really want to believe that it's like analyzable. Yeah.
Starting point is 00:39:47 Yeah. And they seek signal desperately, which is like Peter P hacking. That's just their, that's their flaw. Yeah. And, and so like,
Starting point is 00:39:56 that's like usually my experience with seeing data is that it's either dead obvious in the data or there's no signal and you have to look for it. You're basically p-hacking. Yeah. Unless you have a pretty stable data set. Yeah. But what in this world these days is stable? That's a great... Okay. This is maybe a little bit more of a personal question for you, but I can't resist. And if you don't want to answer, that's totally fine. But you have millions of investors on the platform. A lot of people believe that investment is an inherently emotional decision and we convince ourselves that it's a quantitative decision. What's your take on the individual investor? And the reason I ask that is because I'm trying, and really that's a mirror question
Starting point is 00:40:48 for me, because I think about investing when there were interest rates were zero, pandemic, and am I a different person now? I am a different person. A lot of people would say, well, you shouldn't have an emotional response to this, right? Like you have a plan, you stick to it. But I made decisions based on, you know, my perception of things. And a lot of that is emotional. Yeah. I mean, my conclusion is that there's the emotional reality first, and the facts are gathered to fit the emotional reality and that's how people are and doesn't and it doesn't matter if you're an
Starting point is 00:41:31 individual or you're a institutional investor like the facts are second and the emotions are first yep and that's like you know why the passive investing movement has been, I think, really constructive. And a little bit of our investment philosophy is like, we're not trying to be smarter than Sequoia or Blackstone. We're trying to basically just create access and index what hopefully are obvious macro drivers of good companies. But it is frustrating because in 2021, people were shoving money at us. And I'm like, no, just got to just sit and hang out. So hold on, hold your horses.
Starting point is 00:42:20 And now it's like people are much more reticent, much more concerned. Everything's going down. Everything's going to keep going down, I think, generally, broadly at least. And that's a great time to be investing. Prices are much lower than they were and everybody's much more reluctant. And so it's hard. I mean, investing, stock markets are a mass psychosis business. I mean, it's just massive. People look at the markets and think that reflects quantitative analysis,
Starting point is 00:42:51 but it mostly reflects a psychology sentiment. Yeah, yeah. And you can't really get outside your zeitgeist. And so it's one of the hardest parts of my business. Now, I would think though, and I don't want to read too much into it, but one thing that's interesting to me about having a platform like yours is that if you think about an individual making investments, it's not like they're getting data back from the systems that they use.
Starting point is 00:43:25 Again, like let's just think about someone buying a stock on E-Trade or even someone who makes an angel investment in a technology company, right? They're like, as humans, we will automatically bias ourselves to confirm that the decisions that we made were good, especially when it's our personal money, right?
Starting point is 00:43:50 It's very difficult to break out of that. What's really interesting to me about the dynamic that you provide is that you can provide data back that doesn't care about how you feel, right? And so you, in some ways, can potentially stem a little bit of the confirmation bias that we all struggle with by providing sort of an objective perspective with data is that true i think that the way we think about it is slightly different but that we try to provide more content, like more, like, like if you are,
Starting point is 00:44:27 if you were an investor and you're looking at our app, it would, it's a newsfeed. It looks, and you're getting like investment strategy memos about the investments. Like, Oh, we invested in this company.
Starting point is 00:44:39 Why is this company exciting? From Fundrise. Like you. from us. From us. And I, there's a reason why people don't invest in things is they often don't have confidence. And they build confidence by building knowledge.
Starting point is 00:44:54 But the world, I mean, the world today, the last four years, it's just been an absolute roller coaster. Yeah. And, you know, clearly if you look back, like it'll be fine you make your way through it, it goes up, it goes down mostly if you do
Starting point is 00:45:10 the right stuff it does pretty well but that volatility people hate that and that's just a challenge and it goes back to the volatility is just an emotional the emotional reaction to the volatility things are going to go down.
Starting point is 00:45:28 I think maybe it's if you go through enough cycles you get less affected by that up and down. Yeah, I think my big takeaway there is data itself is not education.
Starting point is 00:45:44 I think that's really wise. Okay, I want to get to some specific data questions here. This has been such a fun conversation and I have a million more questions, but can we talk about the data inputs for real estate investment? Because, you know, deal with as a company, what you expose to your users, and I'm sure I'm going to be way wide of the mark here, so keep me honest, but let's think about simple, reasonably stable data, which would be maximum number of tenants, capacity, average rent, average ongoing maintenance costs, things that you have a...
Starting point is 00:46:50 You probably have a lot of historical data that serves as a proxy where you can probably get that within a pretty accurate margin in terms of predicting what the cost basis is going to be from that standpoint. So you have that side of it. But the value of real estate is on some level highly subjective, is influenced significantly by market conditions, changes in the environment, you mentioned construction, the ability to incorporate new technology into some sort of large building, which is a capital investment. There are all these variables that feel much more subjective as inputs into an investment model. What does that look like for you? What does that look like for your end user? And then of course you have, I would assume, an immense amount of third-party data that you're sort of putting in this model is just publicly available, right? I mean, prices,
Starting point is 00:47:55 tax information, all those sorts of things. Yeah. I mean, this is the opportunity and we're in the process of attacking it. But the way it's done today is by hand, manual. Really? You go gather data. Mostly you buy the data. There's a handful of companies that are the primary suppliers. You go get that data and then you manually input it into a spreadsheet. That is brutal.
Starting point is 00:48:28 And there's like, I mean, we have a hundred real estate people at the company and that's just like they spend their days taking stuff out of PDFs and out of spreadsheets and putting them into other spreadsheets and trying to figure
Starting point is 00:48:43 out what... It's a layer and a bunch of assumptions as you said. and putting them into other spreadsheets and i'm like trying to figure out kind of what it's a layer in a bunch of assumptions as you said and then figure out what you should do and i this is like kind of the you said this earlier this is like the big insight mostly the work is getting the information and then getting that information sort of organized and cleaned and it's not actually once you have you have it joined and cleaned and aggregated and everything else, usually the conclusion is pretty straightforward. But the work, the hours are actually going to that front end part. What if that was done with data infrastructure rather than by a bunch of people doing it by hand?
Starting point is 00:49:21 But the entire, whether it's real estate or venture or private equity or whatever, go down the list of all the financial industry, all lending, the lending, if you're, you know, I work with all these banks, like banks all do it by hand. And so that's crazy, but that's how, that's basically where the industry is. And then the part that like, like the part that I wanted to do this because we've done thousands of underwrites, thousands, and I'm like, if you took all those underwrites and back-tested them, you would see that they're no more predictive
Starting point is 00:49:58 than random. Yeah, no more predictive than a venture capitalist call on who's going to win. Yeah. Because the reality is nobody predicted the pandemic in 2019. No one predicted inflation in 2020. No one predicted the Ukraine invasion in 2021. No one predicted... And so no one predicted that climate change would basically wreck Florida. I mean, kind of predicted Florida, but you wouldn't actually think that insurance costs. But it happened in Texas with inflation. Our rents went up 20% in a month.
Starting point is 00:50:42 Typically, that would happen over a decade yeah and so that was unpredictable and so there's just all these things that have been buffeting the models and i think and so i just and so the industry is obsessed with this analysis and it's because i believe that it's really a narrative it's a sales tool yeah it's actually not really data driven it's really a narrative. It's a sales tool. Yeah. It's actually not really data driven. It's not really data driven. It's like,
Starting point is 00:51:09 what do I need to, what do I have to put on this piece of paper that gets us to yes. And that's like how the investment business works. Yep. And I believe you can get to a place it's going to happen because once you have good data infrastructure, right, you can then start attacking it with ML and AI and you can start to get to a place where it's going to happen because once you have good data infrastructure right you can then start attacking it with yeah ml and ai and you can start to get to a place where it's where you know you
Starting point is 00:51:30 it's pretty not objective because ultimately yeah who knows what the next shock is going to be but it's not like you know you can pretend that this person's a genius because there's so much chance in it yeah yeah. Yeah. I think about that as like, there's a mental model I love called like sort of the, like beware of the fat tails, you know, like the, you could be tracking things so closely. Right. And then like a depression hits. Right. And it's really unexpected. Right. Or to, to your point, I mean, there's sort of all these things that, you know, the pandemic, right. That's like a very fat tail thing where it's like, okay, well, in the bell curve, like most things work, but there's an outlier in the fat tail that hasn't really happened or manifested in the way that it
Starting point is 00:52:15 occurs in history. And so it's a novel experience. And that's just part of the fat tail, right? Where it's like, the tail is not actually close to the X axis. It's like pretty far. And so there's a lot of things that could happen in there that are impossible to predict. And so do you sort of view data infrastructure as like getting the tail a little bit closer to the X axis so that we can have more accuracy or like, how do you, right there, does that show up? Right.
Starting point is 00:52:43 I mean, so basically part of the reason that we're so bad or how do you... Right, that's exactly right. So basically, part of the reason that we're so bad at tail risk is psychological because we have a bias to extrapolate the present into the future. And so we have a sort of expectation that the future will be like a bell curve. So that's partly a human dynamic.
Starting point is 00:53:05 And then it's partly a tooling problem. Excel is horrendous at modeling the kind of multivariable kind of outcomes. It's a very linear structure. And so if you get to a place where on the output, you have better tooling and on the input, you have not just people, but AI essentially that's basically helping. You're like, hey, this is where the data is saying, you're saying this, but let's just, a fat tail event happens every four years. How much of a bell curve is this? So I think that this is what's going to happen in the financial industry. It's going to happen. I'm hoping to be part of it. And that's basically,
Starting point is 00:53:53 again, the application of the tooling that you normally cover is being able to get this more sophisticated approach to data into the hands of the business analyst and the business person who today basically is making bad, poor decisions as a result. Yep. Yep. All right. Well, we're at the buzzer, as we like to say. But I'll end with, again, sort of another little bit of a personal question. Okay, so if you had to completely switch careers and you couldn't be involved in real estate and you couldn't be involved in an invest,
Starting point is 00:54:33 you know, sort of the investment world, what would you do? I really would be a teacher. What would you teach? Oh man, history, maybe, you know, like, know like it doesn't almost doesn't matter science i'd love to teach astrophysics or something like like physics would be fun but i i feel like i just i get a lot of joy in figuring things out and then teaching it and so like i guess like my fallback is to go be a teacher somewhere yeah i hope you have like a some gigantic exit and then you know you go teach kids in high school who are like who
Starting point is 00:55:16 are you and you're like well you know i built a huge company and sold it but i'm gonna teach i'm gonna teach you about like how our government works that'd be so fun that would be i i don't teach i just don't want to teach mbas yeah yeah i'll teach anybody but it's not you know like because and bring like uh real i i want to teach entrepreneur because i don't think that's like teachable but i think yeah i agree with that yeah but i think that there's like stories and lessons and stuff from it that I would want to teach to people who are not MBAs. Yeah. Yeah.
Starting point is 00:55:52 And I would argue this is a whole other episode, but I would argue that like physics and history are probably like much more closely related subjects than a lot of people would like take on face value, right? Like, wow. Okay. Wait a second. You have to tell me at some point what you mean.
Starting point is 00:56:08 Yeah, yeah. We'll have you back on and we can talk about it. All right. Well, it's been such a good show. Learned so much. Congrats on the success of the company. Good luck wrangling the data, pulling out of PDFs. And we'll have you back on sometime soon.
Starting point is 00:56:21 Beautiful. Thanks for having me. We hope you enjoyed this episode of the Data Stack Show. Be sure to subscribe on your favorite podcast app to get notified about new episodes every week. We'd also love your feedback. You can email me, ericdodds, at eric at datastackshow.com. That's E-R-I-C at datastackshow.com.
Starting point is 00:56:43 The show is brought to you by Rudderstack, the CDP for developers. Learn how to build a CDP on your data warehouse at rutterstack.com.

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.