Lex Fridman Podcast - Gustav Soderstrom: Spotify
Episode Date: July 29, 2019Gustav Soderstrom is the Chief Research & Development Officer at Spotify, leading Product, Design, Data, Technology & Engineering teams. This conversation is part of the Artificial Intelligence podcas...t. If you would like to get more information about this podcast go to https://lexfridman.com/ai or connect with @lexfridman on Twitter, LinkedIn, Facebook, Medium, or YouTube where you can watch the video versions of these conversations. If you enjoy the podcast, please rate it 5 stars on iTunes or support it on Patreon.
Transcript
Discussion (0)
The following is a conversation with Gustav Sotestrum.
He's the chief research and development officer
Spotify leading their product design, data, technology,
and engineering teams.
As I've said before, in my research and in life in general,
I love music, listening to it and creating it.
And using technology, especially personalization
through machine learning to enrich the music discovery
and listening experience.
That is what Spotify has been doing for years, continually innovating, defining how we experience
music as a society and a digital age.
That's what Gustav and I talk about among many other topics, including our shared appreciation
of the movie True Romance, in my my view one of the great movies of all
time.
This is the Artificial Intelligence Podcast.
If you enjoy it, subscribe on YouTube, give it 5 stars and iTunes, support it on Patreon,
or simply connect with me on Twitter at Lex Friedman, spelled F-R-I-D-M-A-N.
And now here's my conversation with Gustav Saurstrom.
Spotify has over 50 million songs in its catalog, so let me ask the all-important question.
I feel like you're the right person to ask, what is the definitive greatest song of all
time?
Ed Varys for me personally.
So you can't speak definitively for everyone.
I wouldn't believe very much in machine learning if I did, right?
Because everyone had the same taste.
So for you, what is... you have to pick? What is the song?
It's pretty easy for me. There is this song called, You're So Cool, Hans Simmer,
soundtrack to True Romance. It was a movie that made a big impression on me,
and it's kind of been following me through my life.
Actually, how did I play out my wedding?
I sat with the organist and helped him play it on an organ,
which was a pretty interesting experience.
That is probably my, I would say, top three movie of all time.
Yeah, this is an incredible movie. And it came out during my formative years and as I've discovered in music,
you shape your music taste during those years, so it definitely affected me quite a bit.
Did it affect you in any other kind of way?
Well, the movie itself affected me back then. It was a big part of culture.
I didn't really adopt any characters from the movie, but it was a great story of love, fantastic actors.
And really, I didn't even know who Hans Zimmer was at the time,
but fantastic music.
And so that song has followed me.
And the movie actually has followed me throughout my life.
That was Quintam and Turntino, actually, I think,
director or producer. So it's not stairway to heaven or Bohemian rap city it's those are
those are great they're not my personal favorites but I've realized that
people have different tastes and that's that's a big part of what we do well for
me I don't have to stick with stairway to heaven so 35,000 years ago I looked
at something on Wikipedia.
Fluid-like instruments started being used in caves as part of hunting rituals, and primitive
cultural gatherings, things like that.
This is the birth of music.
Since then, we had a few folks, Beethoven, Elvis, Beatles, Justin Bieber, of course, Drake.
So in your view, let's start like high level philosophical.
What is the purpose of music on this planet of ours?
I think music has many different purposes.
I think there's certainly a big purpose, which
is the same as much of entertainment, which is escapism,
and to be able to live in some sort of other
mental state for a while.
But I also think you have the opposite of escaping, which is to help you focus on something
you are actually doing.
I think people use music as a tool to tune the brain, to the activities that they are actually
doing. And it's kind of like, in one sense,
maybe it's the rawest signal.
If you think about the brain as neural networks,
it's maybe the most efficient hack we can do
to actually actively tune it into some state that you want to be.
You can do it in other ways.
You can tell stories to put people in a certain mood.
But music is probably very effective
to get you to a certain mood very fast. You know, there's a social component historically to music where people listen to music together.
I was just thinking about this that to me, and you mentioned machine learning, but to me,
personally, music is a really private thing.
Like I'm speaking for myself, I listen to music, like almost nobody knows the
kind of things I have in my library, except people who are really close to me and they
really only know a certain percentage. There's like some weird stuff that I'm almost
probably embarrassed by, right?
It's called the Guilty Pleasures, right? Everyone has to have the Guilty Pleasures yet.
Hopefully, they're not too bad, but it's, so it's's a, for me, it's personal. Do you think
of music as something that's social or is something that's personal? It is a very.
So, I think it's the same, it's the same answer that you use it for both. We've thought a lot about
this during these 10 years at Spotify, obviously. In one sense, as you said, music
is incredibly social. You go to concerts and so forth. On the other hand, it is your escape
and everyone has these things that are very personal to them. So what we found is that
when it comes to most people claim that they have a friend or
two that they are heavily inspired by and that they listen to.
I actually think music is very social, but in a smaller group setting, it's an intimate
form of relationship.
It's not something that you necessarily share broadly.
Now at concerts, you can argue you do, but then you've gathered a lot of people
that you have something in common with.
I think this broadcast sharing on music
is something we tried on social networks and so forth,
but it turns out that people aren't super interested
in what their friends listen to.
They're interested in understanding
them to have something in common, perhaps, with a friend, but not just as
information. Right, that's really interesting. I was just thinking
at this morning, was seeing this Spotify. I really have a pretty intimate
relationship with Spotify with my playlists, right?
I've had them for many years now
and they've grown with me together.
There's an intimate relationship you have
with a library of music that you've developed
and we'll talk about different ways of play with that.
Can you do the impossible task and try to give a history
of music listening from your perspective
from before the internet and after the internet and just kind of everything leading up to streaming
Spotify. I'll try. It could be a 100 year podcast. I'll try to do a brief version. There
are some things that I think are very interesting during the history of music, which is that before recorded
music, to be able to enjoy music, you actually had to be
where the music was produced, because it couldn't record it
and time shifted.
Creation and consumption had to happen at the same time,
basically concerts.
And so you either had to get to the nearest village
to listen to music.
And while that was cumbersome,
and it severely limited the distribution of music,
it also had some different qualities,
which was that the creator could always interact
with the audience.
It was always live.
And also, there was no time cap on the music.
So I think it's not a coincidence
that these early classical works,
they're much longer than the three minutes.
The three minutes came in as a restriction of the first wax disk that could only contain
a three-minute song on one side, right?
So actually the recorded music severely limited the, or put constraints, I won't say limit.
I mean, constraints are often good, but it put very hard constraints on the music format.
So you kind of said, like instead of doing this opus, like many tens of minutes or something, now you get three
and a half minutes, because then you're out of wax on this disk. But in return, you get
in amazing distribution, you'll reach, well, well, why didn't write it?
Just on that point real quick, without the mass scale distribution, there's a scarcity component where you kind of look forward to it.
We had that, it's like the Netflix versus HBO Game of Thrones.
You wait for the event because you can't really listen to it.
You look forward to it and then it's,
you derive perhaps more pleasure because it's more rare for you
to listen to particular piece.
You think there's value to that scarcity?
Yeah, I think that that is definitely a thing and there's always this component of if you
have something in infinite amounts, will you value it as much?
Probably not.
Humanity is always seeking some, is relative.
So you're always seeking something you didn't have
and when you have it, you don't want to push it as much.
So I think that's probably true,
but I think that's why concerts exist.
So you can actually have both.
But I think net, if you couldn't listen to music
in your car driving, that'd be worse.
That cost will be bigger than the benefit
of the anticipation, I think, that you would have.
So, yeah, it started with live concerts, then it's being able to, you know, the phonograph
invented, right?
You start to be able to record music.
Exactly.
Then you got this massive distribution that made it possible to create two things, I think,
first of all, cultural phenomenons that probably need distribution to be able to happen.
But it also opened access for a new kind of artist.
So you started to have these phenomenons like Beatles
and Elvis and so forth.
That would really a functional distribution, I think,
obviously, of talent and innovation,
but there was also technical component.
And of course, the next big innovation
to come along was radio, broadcast radio.
And I think radio is interesting because it started not
as a music medium, it started as an information medium
for news.
And then radio needed to find something
to fill the time with so that they could honestly play more ads
and make more money.
And music was free
So so then you had this massive distribution. We could program to people. I think those things that ecosystem
is what what created the ability for for hits
But it was also a very broadcast medium
So you would tend to get these massive massive hits, but maybe not such a long tail
In terms of choice of everybody listening to the same stuff.
Yeah, and as you said, I think there are some social benefits to that.
I think, for example, there is a high statistical chance that if I talk about the
latest episode of Game of Thrones, we have something to talk about just statistically
in the age of individual choice, maybe some of that goes away.
So I, I do see the value of like shared cultural components,
but also obviously love personalization.
And so let's catch this up to the internet.
So maybe Napster, well, first of all, there's like MP3s,
those exact tape CDs.
There was a digitalization of music with a CD, really.
It was physical distribution, but the music became digital.
And so they were files, but basically boxed software to use a software analogy.
And then you could start downloading these files.
And I think there are two interesting things that happen back to music used to be longer
before it was constrained by the distribution medium.
I don't think that was a coincidence.
And then really the only music genre to have developed mostly after music was a file
again on the internet is EDM.
And EDM is often much longer than the traditional music.
I think it's interesting to think about the fact that music is no longer constrained in
minutes per song or something. It's a legacy of how an old distribution technology.
And you see some of this new music that breaks the format, not so much as I would have expected
actually by now, but it still happens.
So first of all, I don't really know what EDM is.
Electronic dance music.
Yeah.
You could say Avi G was one of the biggest in this genre.
So the main constraint is of time, something that is three, four, five minutes song.
You get that song, there were eight minutes, ten minutes and so forth.
Because it started as a digital product that you downloaded, so you didn't have this constraint anymore.
So I think it's something really interesting that I don't think has fully happened yet, kind of jumping ahead a little bit to where we are, but I
think there's tons of former innovation in music that should happen now, that
couldn't happen when you needed to really adhere to the distribution constraints.
If you did not hear to that, you would get no distribution. So Gjerk, for example, a Icelandic artist,
she made a high-pad app as an album.
That's very expensive.
Even though the app has great distribution,
she gets nowhere near the distribution
versus staying within the three-minute format.
So I think now that music is fully digital
inside these streaming services,
there is the opportunity to change the format again.
And allow creators to be much more creative
without limiting their distribution ability.
That's interesting that you're right.
It's surprising that we don't see that
taking advantage more often.
It's almost like the constraints of the distribution
from the 50s and 60s have molded the culture to where we want the five,
three to five minutes on, that anything else, not just, so we want the song as consumers
and as artists, like, because I write a lot of music and I never even thought about writing
something longer than 10 minutes.
It's really interesting that those constraints be- Because all your training data has been three and a half minutes on straight. It's really interesting that those constraints,
because all your training data has been three
and a half minutes on, right?
It's right.
OK, so digitization of data led to MP3s.
Yeah, so I think you had this file then
that was distributed physically.
But then you had the components of digital distribution.
And then the internet happened. And there was this vacuum where you had a format
that could be digitally shipped, but there was no business model.
And then all these pirate networks happened,
Napster and in Sweden, Pirate Bay, which was one of the biggest.
And I think from a consumer point of view,
which leads up to the inception of Spotify, from a consumer point of view, which leads up to the inception of Spotify, from
a consumer point of view, consumers for the first time had this access model to music,
where they could, without any marginal cost, they could try different tracks.
You could use music in new ways.
There was no marginal cost.
And that was a fantastic consumer experience. I have access to all the music I ever made.
I think was fantastic. But it was also horrible for artists because there was no business
model around it. So they didn't make any money. So the user need almost drove the user
interface before there was a business model. And then there were these download stores
the user interface before there was a business model. And then there were these download stores
that allowed you to download files,
which was a solution,
but it didn't solve the access problem.
There was still a marginal cost of 99 cents
to try one more track.
And I think that heavily limits how you listen to music.
The example I always give is,
in Spotify, a huge amount of people listen to music
while they sleep, while they
go to sleep and while they sleep.
If that costed you 99 cents per three minutes, you probably wouldn't do that.
And you would be much less adventurous if there was a real dollar cost to explore music.
So the access model is interesting and that it changes your music behavior.
You can be, you can take much more risk because there's no marginal cost to it. Maybe let me linger on piracy for a second because I find, especially coming from Russia, piracy is something that's very interesting.
To me, not me, of course, ever, but I have friends who have part-token piracy of music, software, TV shows, sporting events. And usually to me,
what that shows is not that they can actually pay the money, and they're not trying to save money.
They're choosing the best experience. So what to me, piracy shows is a business opportunity in all these domains.
And that's where I think you're right. Spotify stepped in is basically, piracy was
an experience. You can explore with fine music you like. And actually the interface of piracy
isn't as horrible because it's, I mean, it's that metadata, yeah, metadata, long download times, all kinds of stuff.
And what Spotify does is basically, first rewards,
artists and second makes the experience
of explore music much better.
I mean, the same is true, I think, for movies and so on.
The piracy reveals, in the software space, for example,
I'm a huge user and fan of Adobe
products. And there was much more incentive to pirate Adobe products before they went to
a monthly subscription plan. And now all of the said friends that used to pirate Adobe
products that I know now actually pay gladly for the monthly subscription.
Yeah, I think you're right.
I think it's a sign of an opportunity for product development
and that sometimes there's a product market fit
before there's a business model fit in product development.
I think that's a sign of it.
In Sweden, I think it was a bit of both.
There was a culture where we even had a political
party called a pirate party. And this was during the time when people said that information
should be free. It somehow was wrong to charge for ones and zeroes. So I think people felt
that artists should probably make money somehow else and you know, concerted something.
So at least in Sweden it was part
really social acceptance even at the political level. But that also forced Spotify to compete
with free, which I don't think would actually could have happened anywhere else in the world.
The music industry needed to be doing bad enough to take that risk. And Sweden was like the perfect
testing ground. It had government-funded
high-band with low latency broadband, which meant that the product would work, and it was also
there was no music revenue anyway. So they were kind of like, I don't think this is going to work,
but why not? So this product is one that I don't think could have happened in America,
there was a large music market, for example. So how do you compete with free? Because that's an interesting world of the internet
where most people don't like to pay for things.
So Spotify steps in and tries to be asked
compete with free.
How do you do it?
So I think two things.
One is people are starting to pay for things on the internet.
I think one way to think about it was that advertising
was the first business model because no one would put a credit card on the internet. Transactional
with Amazon was the second and maybe subscription is the third and if you look offline, subscription
is the biggest of those. So that may still happen. I think people are starting to pay but definitely
back then we needed to compete with free and the first thing you need to do is obviously
to lower the price to free and then you need to be better somehow. And the first thing you need to do is obviously to lower the price to free.
And then you need to be better, somehow.
And the way that Spotify was better
was on the user experience, on the actual performance,
the latency of, you know, even if you had
high bandwidth broadband, it would still take you 30 seconds to a minute to download one of these tracks.
So the Spotify experience of starting within the perceptual limit of immediacy about 250 milliseconds
meant that the whole trick was it felt as if you had downloaded all the part, it was on your hard drive.
It was that fast, even though it wasn't. And it was still free, but somehow you were actually still being a legal
citizen. That was the trick that's what if I managed to pull off. So I've actually heard you say
this or write this and that was a surprise that wasn't aware of it because I just took it for granted.
You know, whenever an awesome thing comes along, you just like, oh, of course it has to be this way.
That's exactly right.
That it felt like the entire world's libraries at my fingertips because of that,
of that latency being reduced.
What was the technical challenge introducing the latest?
So there was a group of really, really talented engineers,
one of them called Ludwig Stregius.
He wrote the, actually from Gothenburg, he wrote the initial, the
U-Torn client, which is kind of an interesting backstory to Spotify, you know, that we have
one of the top developers from, from BitTorn clients as well. So he wrote U-Torn, the world's
smallest BitTorn clients. And then he, he was acquired very early by Daniel and Martin,
who founded Spotify.
And they actually sold the U-turn client to Betoant,
but kept living.
So Spotify had a lot of experience
within peer-to-peer networking.
So the original innovation was a distribution innovation,
where Spotify built an end-to-end media distribution system.
Up until only a few years ago, we actually hosted all the music ourselves.
So we had both the service side and the client.
And that meant that we could do things such as having a peer-to-peer solution to use local
caching on the client side, because back then the world was mostly desktop.
But we could also do things like hack the TCP protocols, things like Nagle's algorithm
for kind of exponential back off,
or ramp up and just go full throttle and optimize for latency
at the cost of bandwidth.
And all of this end to end control,
meant that we could do an experience that felt like a step change.
These days, we actually are on GCP.
We don't host our own stuff, and everyone is really fast these days.
So that was the initial competitive advantage, but then obviously you have to move on over
time.
And that was over 10 years ago, right?
That was in 2008, the product was launched in Sweden, it was in a beta, I think, 2007.
And it was on the desktop, right?
It was desktop only.
There's no phone.
There was no phone.
The iPhone came out in 2008, but the app store came out one
year later, I think. So the writing was on the wall, but there was no phone yet.
You've mentioned that people would use Spotify to discover the songs they like, and then they would
torrent those songs so they can copy it to their phone, just hilarious. I'm not torn, Pirate.
Seriously, Piracy does seem to be like a good guide
for business models video content.
As far as I know, Spotify doesn't have video content.
Well, we do have music videos and we do have videos on the service,
but the way we think about ourselves is that we're an audio service.
And we think that if you look at the amount of time that people spend on audio, it's actually
very similar to the amount of time that people spend on video.
So the opportunity should be equally big, but today is not at all valued, video is valued
much higher.
So we think it's basically completely under-value.
We think of ourselves
as an audio service, but within that audio service, I think video can make a lot of sense.
I think for when you're discovering an artist, you probably do want to see them and understand
who they are to understand their identity. You won't see that video every time. No, 90%
of the time the phone is going to be in your pocket. For podcasters, you use video. I think
that can make a ton of sense. So we do have video, but we're in audio service where think of it as we call it internally
back-roundable video. Video that is helpful, but isn't the driver of the narrative.
I think also if you look at YouTube, the way people, there's quite a few folks who listen
to music on YouTube. So in some sense,
YouTube is a bit of a competitor to Spotify, which is very strange to me that people use
YouTube to listen to music. They play essentially the music videos, right? But don't watch the videos
and put in their pocket.
Well, I think it's similar to what's strangely, maybe it's similar to what we were for the
Pires in Networks, where you too, for historical reasons, have a lot of music videos.
So people use YouTube for a lot of the discovery part of the process, I think.
But then it's not a really good sort of quote-unquote MP3 player, because it doesn't even background. Then you have to keep the app, I think. But then it's not a really good, sort of, quote-unquote, MP3 player, because it doesn't even background.
Then you have to keep the app in the foreground.
So it's not a good consumption tool,
but it's a decent, really good discovery.
I mean, I think you're a bit of a fantastic product.
And I use it for whole content purposes, education.
That's true.
If I were to admit something, I do use YouTube a little bit
for the discovery, to assist in the discovery process of songs.
And then if I like it, I'll add it to Spotify.
But that's okay.
That's okay with us.
Okay, so sorry, we're jumping around a little bit.
So this kind of incredible, you look at Napster, you look at the early days of Spotify.
How do you, one fascinating point is, how do you grow a user base?
So you learn in Sweden, you have an idea.
I saw the initial sketches that looked terrible.
How do you grow user base from a few folks to millions?
I think there are a bunch of tactical answers.
So first of all, I think you need a great product.
I don't think you take a bad product
and market it to be successful. So you need a great product. But don't think you take a bad product and market it to be successful.
So you need a great product.
But sorry to interrupt, but it's a totally new way to listen to music too. So it's not just
that people realize immediately that Spotify is a great product.
I think they did. So back to the point of piracy, it was a totally new way to listen to music
legally, but people had been used to the access model in Sweden and
the rest of the world for a long time through piracy.
So one way to think about Spotify, it was just legal and fast piracy.
And so people have been using it for a long time.
So they weren't alien to it.
They didn't really understand how it could be legal because it was seemed too fast and too
good to be true, which I think is a great product proposition if you can be too good to
be true.
But what I saw again and again was people showing each other,
clicking the song, showing how fast it started and saying,
I can't believe this.
So I really think it was about speed.
Then we also had an invite program that was really meant for scaling
because we hosted our own service.
We needed a control scaling.
But that built a lot of expectation,
and I don't wanna say hype,
because hype implies that it wasn't true.
Explicitement.
Excitement around the product.
And we replicated that when we launched in the US,
we also built up and it might only program first.
There are lots of tactics,
but I think you need a great product
that solves some problem.
And basically, the key innovation, lots of tactics, but I think you need a great product to solve some problem and
basically the key innovation, there was technology, but on a metal level the
innovation was really the access model versus the ownership model. And that was
tricky. A lot of people said that they wanted to own their music, they would
never kind of rent it or borrow it, But I think the fact that we had a free tear,
which meant that you get to keep this music for life
as well helped quite a lot.
So this is an interesting psychological point.
It may be you can speak to.
It was a big shift for me.
Like it's almost like a, like to go to therapy for this.
Is I think I would describe my early listening experience and
I think a lot of my friends do is basically hoarding music.
Is you're like slowly one song by one song or maybe albums gathering a collection of
music that you love and you own it.
It's like often especially with CDs or tape you like physically had it.
And then what Spotify what I had to come to grips with,
it was kind of liberating actually,
is to throw away all the music.
I've had this therapy session with lots of people.
And I think the mental trick is,
so actually we've seen the user data when Spotify started.
A lot of people did the exact same thing.
They started hoarding as if the music would disappear,
right?
Almost equivalent of downloading.
And so, you know, we had these playlists that had limits
of like, if you 100,000 tracks,
if you know what we'll ever, like, well, they do.
Nuts and hundreds and hundreds of thousands of tracks.
And to this day, you know, some people
want to actually save, quote unquote,
and play the entire catalog.
But I think the therapy session
goes something like, instead of throwing away your music,
if you took your files and you stored them in a locker
at Google, it'd be a streaming service.
It's just that in that locker, you have all the world's
music now for free.
So instead of giving away your music,
you got all the music.
It's yours.
It's a, you could think of it as having a copy of the world's catalogue there forever.
So you actually got more music instead of less.
It's just that you just took that hard disk and you sent it to someone who stored it for you.
And once you go through that mental journey of like still my files, they're just over there
and I just have 40 million or 50 million or something now.
Then people are like, okay, that's good. The problem is I think
Because you paid us a subscription
If we hadn't had the free tier where you would feel like even if I don't want to pay anymore
I still get to keep them you keep your playlists forever. They don't disappear even though you stop paying
I think that was really important if we would have started as
You know you can put in all this time, but if you stop paying, you lose all your work, I think that would have been a big challenge and was the big challenge
for a lot of our competitors.
That's another reason why I think the free tier is really important, that people need to
feel the security that the work they put in, it will never disappear, even if they decide
not to pay.
I like it, I put the work, I put in, I actually stopped even thinking about that way.
I just actually Spotify taught me to just enjoy music.
As opposed to what I was doing before, which is like in an unhealthy way hoarding music.
Which I found that because I was doing that, I was listening to a small selection of songs way too much,
to where I was getting sick of them. Whereas Spotify is the more
liberating kind of approach as I was just enjoying. Of course, I listened to Stairwear
to Heaven over and over, but because of the extra variety, I don't get as sick of them.
There's an interesting statistic I saw. So Spotify has maybe you can correct me, but over 50 million songs, tracks, and over 3 billion playlists.
So 50 million songs and 3 billion playlists.
60 times more playlists songs.
What do you make of that?
Yeah, so the way I think about it is that from a, from a, from a statistician or machine
learning point of view, you have all these, if you want to think about reinforcement learning,
where you have this state space of all the tracks,
and you can take different journeys through this, through this world.
And these, I think of these as like people helping themselves and each other,
creating interesting vectors through this space of tracks.
And then it's not so surprising that across many tens of millions of kind of atomic units,
there will be billions of paths that make sense.
And we're probably pretty quite far away from having found all of them.
So kind of our job now is users, when Spotify started, it was really a search box that
was for the time pretty powerful,
and then I'd like to refer to this programming language called Playlisting, where if you,
as you probably were pretty good at music, you knew your new releases, you knew your back-out-log,
you knew your start with the heaven, you could create a soundtrack for yourself using this
Playlisting tool that's like met the programming language for music to soundtrack your life.
People who were good at music, it's back to how do you met the programming language for music, to soundtrack your life. And people who were good at music,
it's back to how do you scale the product.
For people who were good at music,
that was actually enough.
If you had the catalog in a good search tool,
you can create your own sessions,
you could create really good soundtrack for your entire life.
Probably perfectly personalized because you did it yourself.
But the problem was, most people,
many people aren't that good at music.
They just can't spend the time.
Even if you're very good at music,
it's gonna be hard to keep up.
So what we did to try to scale this
was to essentially try to build,
you can think of them as agents that,
this friend that some people had
that helped them navigate this music catalog,
that's what we're trying to do for you.
But also there is something like 200 million active users on Spotify.
So there's okay.
So from the machine learning perspective, you have these 200 million people plus.
They're creating, it's really interesting to think of playlists as, I mean, I don't know
if you meant it that way, but it's almost like a programming language.
It's a release, a trace of exploration of those individual agents, the listeners.
And you have all this new tracks coming in.
So it's a fascinating space that is ripe for machine learning.
So how can playlists be used as data in terms of machine learning
and just to help Spotify organize the music?
So we found in our data not surprising that people who play listed lots, they retain
much better, they had a great experience.
And so our first attempt was to play list for users.
And so we acquired this company called Tunigo of editors and professional playlisters
and kind of leveraged the maximum of human intelligence to help build
kind of these vectors through the track space for people.
And that brought in the product.
Then the obvious next, and we used statistical means
where they could see when they created a playlist,
how did that playlist perform?
They could see skips of the songs,
they could see how the songs perform,
and they manually iterated the playlist to maximize performance for a large group of people.
But there were never enough editors to play lists for you personally.
So the promos on machine learning was to go from kind of looked at these playlists and we had some people in the company, a person named Eric Van Van van van van van van van back in back then, in like 2007, 2008, back then
it was mostly collaborative filtering and so forth, but we realized that what this is,
is people are grouping tracks for themselves that have some semantic meaning to them and
then they actually label it with a playlist name as well.
So in a sense, people were grouping tracks along semantic dimensions and labeling them.
And so could you use that information to find that latent embedding?
And so we started playing around with collaborative filtering and we saw tremendous success with
it.
Basically trying to extract some of these dimensions.
And if you think about it's not surprising at all,
it would be quite surprising if playlists were actually random
if they had no semantic meaning.
For most people, they grouped these tracks for some reason.
So we just happen across this incredible data set
where people have taken these tens of millions of tracks
and grouped them along different semantic vectors.
And the semantics being outside the individual users, that's some kind of universal.
There's a universal embedding that holds across people on the surface.
Yes, I do think that the embeddings you find are going to be reflective of the people who play listed.
So, if you have a lot of indie lovers who play lists, your embeddings are going to perform
better there.
But what we found was that, yes, there were these latent similarities.
They were very powerful.
And we had, it was interesting because I think that the people who play this are the most initially
were the so-called music of physiognados who really into music and they often had a certain
their taste of certain gear towards a certain type of music.
And so what surprised us, if you look at the problem from the outside, you might expect
that the algorithms would start performing best with mainstreamers first
because it somehow feels like an easier problem to solve mainstream taste.
Then really particular taste was a complete opposite for us.
The recommendations performed fantastically for people who saw themselves as having very
unique taste.
That's probably because all of them play less.
And they didn't perform so well for mainstreamers.
They actually thought they were a bit too particular and unorthodox. So we had a complete opposite of what we expected. Success within the
hardest problem first and then had to try to scale to more mainstream recommendations.
So you've also acquired
Econests that analyze a song data. So in your view, maybe you can talk about,
so what kind of data is there from a machine learning
perspective?
There's a huge amount, we're talking about playlist thing,
and just user data of what people are listening to,
the playlist they're constructing, and so on.
And then there is the actual data within a song,
what makes a song, I don't know, the actual waveforms.
How do you mix the two, how much values are in each?
To me, it seems like user data is,
well, it's a romantic notion that the song itself
will contain useful information,
but if I were to guess user data would be much more powerful.
Like Playlist would be much more powerful.
Yeah, so we use both. Our biggest success initially was with Playlist data without understanding
anything about the structure of the song, but when we acquired the Econus, they had the inverse problem.
They actually didn't have any play data. They were a provider
of recommendations, but they didn't actually have any play data. So they looked at the
structural songs sonically, and they looked at Wikipedia for cultural references and so
forth, right? And they did a lot of NLU and so forth. So we got that skill into the company
and combined our user data with their content-based.
So you can think of it as we were user-based and they were content-based in their recommendations.
We combine those two.
For some cases where you have a new song that has no play data, obviously you have to
try to go by either the artist's or the sonic information in the song
or what it's similar to.
So there's definitely a value in both,
and we do a lot in both, but I would say yes.
The user data captures things that have to do
with culture in the greater society
that you would never see in the content itself.
But that said, we have seen, we have a research lab
in Paris.
We can talk about more about that on machine learning
on the creator side.
What it can do for creators, not just for the consumers.
But what we looked at, how does the structure of a song actually
affect the listening behavior?
And it turns out that we can predict things like skips,
based on the song itself,
we could say that maybe you should move that chorus a bit
because your skip is gonna go up here.
There is a lot of latent structure in the music,
which is not surprising,
because it is some sort of mind hack.
So there should be structure,
that's probably what we respond to.
We just blew my mind actually
from the creator perspective.
So that's a really interesting topic
that probably most creators aren't taking
advantage of. Right. So there's a recently got to interact with a few folks, YouTubers,
who are like obsessed with this idea of what do I do to make sure people keep watching
the video. And they like look at the
analytics of which point do people turn it off and so on. First of all, don't
think that's healthy but it's because you can do it a little too much but it is a
really powerful tool for helping the creative process. You just made me
realize you could do the same thing for creation of music. Is that something you've looked into?
Is it, can you speak to how much opportunity there is for that thing?
Yeah, I listened to the podcast with Zerosh and I thought it was fantastic and I reacted
to the same thing where he said he posted something in the morning, immediately watched
the feedback where the drop-off was and then responded to that in the morning, immediately watched the feedback where the drop off was and then responded to that in the afternoon, which is quite different from how people make podcasts
for example.
Yes, exactly.
I mean, the feedback loop is almost non-existent.
So if we back out one level, I think actually both for music and podcasts, which we also
do is at Spotify, I think there's a tremendous opportunity just for the creation workflow.
And I think it's really interesting speaking to you because you're a musician, a developer,
and a podcaster. If you think about those three different roles, if you make the leap as
a musician, if you think about it as a software tool chain, really, your door with the stems,
that's the IDE, right? That's
where you work in source code format with your, with what you're creating. Then you sit
around and you play with that, and when you're happy, you compile that thing into some sort
of, you know, ACR, MP3 or something. You do that because you get distribution. There
are so many run times for that MP3 across the world, and car stairs and stuff. So if you
kind of compile this executable, you you ship it out, and kind of
hold fashion box software analogy.
And then you hope for the best, right?
But as a software developer, you would never do that.
First, you go and get how you collaborate with other creators.
And then you think it'd be crazy to just ship one version
of your software without doing an A, B test without any feedback loop and then issue tracking.
Exactly.
And then you would look at the feedback loops and try to optimize that thing.
So I think if you think about it as a very specific software tool chain, it looks quite
arcane.
The tools that a music creator has versus what a software developer has.
So that's kind of how we think about it.
And why wouldn't a music creator have something like GitHub?
You could collaborate much more easily.
So we bought this company called SoundTrap, which
has a kind of Google Docs for Music approach, where
you can collaborate with other people on the source code format
with stamps.
And I think introducing things like AI tools there to help you
ask your creating music, both in helping you put a compliment
to your music like drums or something, help you master and mix
automatically, help you understand how this track will perform.
Exactly what you would expect as a software developer, I think makes a lot of sense.
And I think the same goes for a podcaster.
I think podcasters will expect to have the same kind of feedback loop that Ciroch has.
Like, why wouldn't you?
Maybe it's not healthy, but...
Sorry, I wanted to criticize the fact because you can overdo it.
Because a lot of the
and we're in a new era of that. So you can become addicted to it and therefore
what people say you become a slave to the YouTube algorithm. Or sort of it's always a danger of
a new technology as opposed to say if you're creating a song,
becoming too obsessed about the intro riff to the song that keeps people listening
versus actually the entirety of the creation process.
Yeah, it's a balance.
Absolutely.
But the fact that there's zero, I mean, you're blowing my mind right now because you're
completely right that there's no signal whatsoever.
There's no feedback whatsoever in the creation process and music or podcasting.
Almost at all.
And are you saying that Spotify is hoping to help create tools to not tools, but
to tools actually.
Actually tools for creators.
Absolutely. So we have, we've made some acquisitions
the last few years around Music creation.
It's called SoundTrap, which is a dual audio workstation,
but that is browser-based.
And their focus was really the Google Docs approach.
We can collaborate with people much more easily
than you could in previous tools.
So we have some of these tools that we're working with
that we want to make accessible,
and then we can connect it with our consumption data.
We can create this feedback loop where we could help you understand,
we could help you create and help you understand how you will perform.
We also acquired this other company with
a podcasting called Anchor,
which is one of the biggest podcasting tools,
mobile focused, so really focused on simple creation,
or easy access to creation.
But that also gives us this feedback loop.
And even before that, we invested in something
called Spotify for Artists and Spotify for Podcasters, which
is an app that you can download, you can verify that you
are that creator.
And then you get things that software developers
have had for years.
You can see where, if you look at your podcast,
for example, on Spotify, or some of the you released,
you can see how it's performing, which
citizen is performing, and who's listening to it,
what's the demographic break up.
So similar in the sense that you can understand
how you're actually doing on the platform.
So we definitely want to build tools. I think you also interviewed the
head of research for Adobe. And I think that's in, that's in back to Photoshop that you like.
I think that's an interesting analogy as well. Photoshop I think has been very innovative
in helping photographers and artists.
And I think there should be the same kind of tools for music creators, where you could
get AI assistants, for example, as you're creating music, as you can do with Adobe, where
you can, I want to sky over here and you can get help creating that sky.
The really fascinating thing is what Adobe doesn't have is a distribution for the content
you create.
So you don't have the data if I create, whatever creation I make in Photoshop or Premiere,
I can't get immediate feedback.
I can on YouTube, for example, about the way people are responding. And if Spotify is creating those tools that, that's a, it's a really exciting
actually world. But let's talk a little about podcasts. It's, so I have trouble
talking to one person. So it's a bit terrifying and kind of hard to fathom,
but an average 60 to 100,000 people will listen to this episode.
Okay.
It's intimidating.
It's intimidating.
I host that I'm Blueberry.
I don't know if I'm pronouncing that correctly, actually.
It looks like most people listen to it on Apple podcasts, cast box, and pocket gas.
And only about a thousand.
Listen on Spotify.
It just my podcast, right?
So where do you see a time when Spotify will dominate this?
So Spotify is relatively new into this podcasting. Podcasting.
Sorry.
Yeah.
In podcasting.
What's the deal with podcasting and Spotify? How serious is Spotify about podcasting. Yeah, in podcasting. What's the deal with podcasting and Spotify?
How serious is Spotify about podcasting?
Do you see a time where everybody would listen to,
you know, probably a huge amount of people,
majority perhaps listen to music on Spotify?
Do you see a time when the same is true for podcasting?
Well, I certainly hope so.
That is our mission.
Our mission as a company is actually
to enable a million creators to live off of their art,
and a billion people inspired by it.
And what I think is interesting about that mission
is it actually puts the creators first,
even though it's not as a consumer focused company,
and it says to be able to live off of their art,
not just make some money off of their art as well.
So it's quite an ambitious project.
And so, we think about creators of all kinds, and we kind of expanded our mission from being
music to being audio a while back.
And that's not so much because we think we made that decision. We think that decision was made for us. We think the we made that decision.
We think that decision was made for us.
We think the world made that decision.
Whether we like it or not,
when you put in your headphones,
you're gonna make a choice between music
and a new episode of your podcast or something else.
We're in that world whether we like it or not.
And that's how radio work.
So we decided that we think it's about audio.
You can see the rise of audio books and so forth.
We think audio is this great opportunity.
So we decided to enter it.
And obviously Apple and Apple Podcasts
is absolutely dominating in podcasting.
And we didn't have a single podcast only like two years ago.
What we did though was we looked at this and said,
you know, can we bring something to this?
You know, we want to do this, but back to the original Spotify,
we have to do something that consumers actually value to be able to do this.
And the reason we've gone from not existing at all to being the
the quite a wide margin,
the second largest podcast consumption,
still wide gap to iTunes, but we're growing quite fast.
I think it's because when we looked at the consumer problem,
people said surprisingly that they wanted their podcasts
and music in the same application.
So what we did was we took a little bit of a different approach
where we said, instead of building a separate podcast app,
we thought, is there a consumer problem to solve here?
Because the others are very successful already.
And we thought there was in making a more seamless experience
where you can have your podcast and your music
in the same application.
Because we think it's audio to you.
And that has been successful and
that meant that we actually had 200 million people to offer this to instead of starting
from zero.
So I think we have a good chance because we're taking a different approach than the competition
and back to the other thing I mentioned about creators because we're looking at the end
to end flow.
I think there's a tremendous amount of innovation to do around podcast as a format. When we have creation tools and
consumption, I think we could start improving what podcasting is. I mean,
podcast is this opaque, big like one, two hour file that you're streaming,
which it really doesn't make that much sense in 2019, that it's not interactive,
there's no feedback loops, nothing like that.
So I think if we're gonna win,
it's gonna have to be because we build a better product
for creators and for consumers.
So we'll see, but it's certainly our goal.
We have a long way to go.
Well, the creator's part is really exciting.
You already, you got me hooked there.
There's the only stats I have.
A blueberry just recently added the stats
of whether it's listen to the end
or not.
And that's like a huge improvement, but that's still nowhere to where you could possibly
go into statistics.
You just download this, Spotify, podcast, there's Oppen Verify, and then you'll know where
people dropped out in this episode.
Oh, wow, okay.
The moment I started talking, okay, I might be depressed by this. But okay, so one
one other question is the original Spotify for music and I have a question about podcasting in
this line is the idea of albums I have what did you, music of officianados, friends who are really big fans of music,
often really enjoy albums,
listening to entire albums of an artist.
Correct me if I'm wrong, but I feel like Spotify has helped
replace the idea of an album with playlists.
So you create your own albums.
It's kind of the way at least I have experienced music
and I really enjoy it that way.
One of the things that was missing in
podcasting for me, I don't know if it's missing,
I don't know. It's an open question for me.
But the way I listen to podcasts is the way I would listen to albums.
So I take Joe Rogan experience, and that's an album.
And I listen, you know, I like, I put that on, and I listen you know I like I put that on
and I listen one episode after the next and there's a sequence and so on. Is there
room for doing what you did from music or doing what Spotify did for music but
creating playlists sort of this kind of playlisting idea of breaking apart
from podcasting from individual podcast and creating kind of this kind of play-listing idea of breaking apart from podcasting from individual podcast and creating kind of
this interplay or to have you thought about that space. It's a great question. So I think in
In music you're right. Basically you bought an album. So it was like you bought a small catalog of like 10 tracks
Right, it was again. It was actually a lot of a lot of consumption
You think it's about what you like but it's based on the business model, right? So was again, it was actually a lot of a lot of consumption. You think it's about what
you like, but it's based on the business model. You paid for this 10 track service and then you
listened to that for a while. And then when everything was flat-priced, you tended to listen differently.
Now, so I think the album is still tremendously important. That's why we have it. And you can save
albums and so forth. And you have a huge amount of people who really listen, according to albums.
And I like that because it is a creator format.
You can tell a longer story over several tracks.
And so some people listen to just one track.
Some people actually want to hear that whole story.
Now, in podcast, I think it's different.
You can argue that podcasts might be more like shows on Netflix.
You have like a full season of narcos.
And you're probably not going to do like one episode of narcos and then one of house
or cards.
You know, there's a narrative there and you love the cast and you love these characters.
So I think people will, people love shows.
And I think they will, they will listen to those shows.
I do think you follow a bunch of shows at the same time.
So there's certainly an opportunity to bring you
the latest episode of whatever the five, six, 10 things
that you're into.
But I think people are going to listen to specific hosts
and love those hosts for a long time.
Because I think there is something different with podcasts
where this format of
the experience of the audience is actually sitting here right between us. Whereas if you
look at something on TV, the audio actually would come from, you would sit over there,
the audio would come to you from both of us as if you were watching, not as if you were
part of the conversation. So my experience is I've been listening to podcasts like yours
and Joe Roganess, I feel like I know all of these people, they have no idea who I am, part of the conversation. So my experience is having listened to podcasts like yours and
Joe Roganess, I feel like I know all of these people. They have no idea who I am, but I
feel like I've listened to so many hours and it's very different from me watching a TV
show or an interview. So I think you kind of fall in love with people and experience in
a different way. So I think shows and hosts are going to be very
very important. I don't think that's going to go away into some sort of thing where you
don't even know who you're listening to. I don't think that's going to happen. What I do
think is I think there's a tremendous discovery opportunity in podcasts because the chat
the log is growing quite quickly. And I think podcasts is only a few like 500, 600,000 shows right now.
If you look back to YouTube as another analogy for creators, no one really knows if you would
lift a lid on YouTube, but it's probably billions of episodes.
And so I think the podcast catalog would probably grow tremendously because the creation tools
are getting easier.
And then you're going to have this discovery opportunity that I think is really big.
So a lot of people tell me that they love their shows, but discovering podcasts kind of
suck.
It's really hard to get into new show.
They usually quite long.
It's a big time investment.
So I think there's plenty of opportunity in the discovery part.
Yeah, for sure.
A hundred percent.
And even the dumbest, there's so many low-hanging
fruit too. For example, just knowing what episode to listen to first to try out a
podcast. Exactly. Because most podcasts don't have an order to them. They can be
listened to out of order. sorry to say some are better than
others episodes. So some episodes of Joe Rogan are better than others and it's
nice to know which you should listen to to try it out and there's as far as I
know almost no information in terms of like upvotes on how good an episode is.
Exactly.
So I think part of the problem is,
it's kind of like music.
There isn't one answer.
People use music for different things.
And there's actually many different types of music.
There's workout music and there's classical piano music
and focus music and so forth.
I think the same with podcasts.
Some podcasts are sequential.
They're supposed to be listened to in order.
It's actually telling an narrative.
Some podcasts are one topic, kind of like yours,
but different guests, so you could jump in anywhere.
Some podcasts actually have completely different topics.
And for those podcasts, it might be that,
I want, we should recommend one episode,
because it's about AI from someone,
but then they talk about something
that you're not interested in the rest of the episode.
So I think, well, we're spending a lot of time on now.
It's just first understanding the domain
and creating kind of the knowledge graph
of how do these objects relate and how do people consume.
And I think we'll find that it's going to be different.
I'm excited.
It's a, you're the spotifies the first people I'm aware of that are trying to do this for
podcasting.
Podcasting has been like a wild west up until now.
It's been a very, we want to be very careful though because it's been a very good wild west.
I think it's this fragile ecosystem and we want to make sure that you don't barge in and say,
we're going to internetize this thing.
And you have to think about the creators.
You have to understand how they get distribution today.
Who listens to how they make money today?
Try to make sure that their business model works,
that they understand.
I think it's back to doing something
to improving their products, like feedback loops
and distribution.
So jumping back into terms of this fascinating world
of recommender system listening to music
and using machine learning to analyze things,
do you think it's better to what currently,
correct me if I'm wrong,
but currently Spotify lets people
pick what they listen to.
The most part, there's a discovery process, but you kind of organize playlists.
Is it better to let people pick what they listen to or recommend what they should listen
to?
Something like stations by Spotify that I saw that you're playing around with maybe you can tell me what's the status of that?
This is a Pandora style app that just kind of as opposed to you select the music you listen to it kind of
feeds you
Music you listen to what's the status of stations by Spotify? What's its future?
the store is Spotify as we have grown has been as we made it more accessible to different audiences.
And stations is another one of those where the question is
some people want to be very specific.
They actually want to start with them.
And right now that needs to be very easy to do.
And some people, or even the same person, at some point might say
I want to feel upbeat or I want to feel upbeat, or I want to feel happy,
or I want songs to sing in the car.
So they put in the information at a very different level,
and then we need to translate that into what that means musically.
So stations is a test to create like a consumption input vector
that is much simpler, where you can just tune in a little bit
and see if that increases the overall reach.
But we're trying to kind of serve the entire gamut of super advanced so-called music of
hisionados all the way to people who they love listening to music, but it's not their
number one priority in life, right?
They're not going to sit and follow every new release from every new artist.
They need to be able to influence music at a different level.
So we're trying, you can think of it as different products.
And I think when one of the interesting things to answer your question on, if it's better
to lift the user, choose or to play, I think the answer is the challenge when machine learning
kind of came along, there was a lot of thinking about what does
product development mean in a machine learning context.
People like Andrew Eng, for example,
when he went to Baidu, he started doing a lot of practical machine learning,
went from academia and he thought a lot about this.
He had this notion that a product manager,
a designer and they used to work around this wireframe,
describe what the product should look like, who some talk about when you're doing a chat manager or designer and they used to work around this wireframe, kind of describe what the product should look like, with something to talk about.
When you're doing like a chat bot or a playlist, how do you, what are you going to say?
Like, it should be good.
That's not a good product description.
So how do you, how do you do that?
And he came up with this notion that the test set is the new wireframe.
The job of the product manager is to source a good test set that is representative of what,
like, if you say like, I want to play the status, so on existing in the car.
Job of the product manager is going source like a good test set of what that means.
Then you can work with the engineering to have algorithms to try to produce that, right?
So we try to think a lot about how to structure product development for a machine learning
age.
And what we discovered was that a lot of it
is actually in the expectation.
And you can go two ways.
So let's say that if you set the expectation with the user,
that this is a discovery product, like Discover Weekly,
you're actually setting the expectation
that most of what we show you will not be relevant.
When you're in the discovery process,
you can accept that actually if you find one gem every Monday that you
totally love, you're probably going to be happy. Even though statistical meaning one out
of ten is terrible or one out of twenty is terrible. From a use point of view, because
the setting was discovered is fine. Can I start to interrupt real quick? I just actually
learned about Discover Weekly, which is a Spotify, I don't know.
It's a feature of Spotify that shows you cool songs.
To listen, I, maybe I can do issue tracking.
I couldn't find out my Spotify app.
It's in your library.
It's in the library.
It's in the list of life.
Because I was like, whoa, this is cool.
I didn't know this existed.
And I tried to find it.
But I'll show a team. I'll hand feed back to this is cool. I didn't know this existed. And I tried to find it. But I was.
OK.
I'll show it to you.
And feedback to our product teams.
Yeah.
There you go.
But yeah, so yeah, sorry.
Just to mention, the expectation there
is basically you're going to discover new songs.
Yeah, so then you can be quite adventurous in the recommendations
you do.
But if you're, but we have another product called Daily Mix,
which kind of implies that these are only
going to be your favorites.
So if you have one out of 10, that is good,
and nine out of 10 that doesn't work for you,
you're going to think it's a horrible product.
So actually a lot of the product development,
we learned over the years,
is about setting the right expectations.
So for Daily Mix, algorithmically,
we would pick among things that feel very safe in your taste space.
There's a couple of weekly, we go kind of wild because the expectation is, most of this is not gonna.
So a lot of that, a lot of times your question there, a lot of, should you let the user pick or not?
It depends. We have some products where the whole point is, I do use it in click play, put the phone in the pocket,
and it should be really good music for like an hour.
We have other products where you probably need to say like no, no, save, no, no, and it's
very interactive.
I see.
That makes sense.
And then the radio product, the station's product is one of these like click play, put
in the pocket for hours.
That's really interesting.
So you're thinking of different test sets for different users in trying to create products
that sort of optimize for those test sets
that represent the specific set of users.
Yes, I think one thing that I think is interesting is
we invested quite heavily in editorial
in people creating playlists
using statistical data,
and that was successful for us, and then we also invested in machine learning.
And for the longest time, within Spotify and within the rest of the industry,
there was always this narrative of humans versus the machine,
how to go versus editorial, and editors would say like, well,
if I had that data, if I could see or play listing history,
and I made a choice for you,
I would have made a better choice.
And they would have, because they're much smarter
than these algorithms.
Human is incredibly smart compared to our algorithms.
They can take culture into account and so forth.
The problem is that they can't make 200 million decisions
per hour for every user that logs in.
So the algorithm may be not as sophisticated,
but much more efficient.
So there was this contradiction.
But then a few years ago, we started focusing on this kind
of human in the loop thinking around machine learning.
And we actually coined an internal term for it called
Algotorial, the combination of algorithms and editors,
where if we take
a concrete example, you think of the editor, this paid expert that we have, there's really
good at something like, so hip hop, EDM, something, right?
There are two experts, no end one in the industry.
So they have all the cultural knowledge.
You think of them as the product manager, and you say that, let's say that you want to create a...
You think that there's a product need in the world
for something like songs to sing in the car,
or songs to sing in the shower.
I'm taking that example because it exists.
People love to scream songs in the car when they drive, right?
So you want to create that product.
Then you have this product manager who's a musical expert.
They come up with a concept.
I think this is a missing thing in humanity,
like a playlist called Song Sishing in the Car.
They create the framing, the image, the title,
and they create a test set.
Or they create a group of songs, like a few thousand songs
out of the catalog that they manually curate,
that are known songs that are great to sing in the car.
And they can take, like, through romance into account, they understand things that our algorithms do not at all.
So they have this huge set of tracks.
Then when we deliver that to you,
we look at your taste vectors and you get the 20 tracks that are songs to sing in the car in your taste.
So you have personalization and editorial input in the same process.
If that makes sense.
Yeah, it makes total sense.
And several questions around that this is like fascinating.
Okay, so first it is a little bit surprising to me that the world expert humans are outperforming machines
at specifying songs
a sing in the car.
So maybe you could talk to that a little bit.
I don't know if you can put it into words, but what is it?
How difficult is this problem of, do you really,
I guess what I'm trying to ask is there, how difficult is it to encode
the cultural references,
the context of the song, the artists, all those things together? Can machine learning really not do that?
I mean, I think machine learning is great at replicating patterns.
If you have the patterns, but if you try to write with me a speck what songs great to song to sing in the card definition is
Is it is it loud? There's a many courses to do the linen movies?
It's it quickly gets incredibly complicated, right?
Yeah, and and a lot of it may not be in the structure of the song or the title
It could be cultural references because you know, it was a hasty one so so the definition problems
You know, it was a hasty run. So the definition problems quickly get. And I think that was the, that was the insight of Andrew Eng.
When he said that job of the product managers to understand these things that
at algorithms don't, and then define what that looks like.
And then you have something to train towards, right?
Then you have kind of the test set.
And then so today, the editors create this pool of tracks,
and then we personalized, you could easily imagine that once you have this set, you could have some automatic exploration
on the rest of the catalog, because then you understand what it is.
And then the other side of it when machine learning does help is this taste of vector.
How hard is it to construct a vector that represents the things an individual human likes. The human preference. So you can, you know, music isn't like,
it's not like Amazon, like things you usually buy.
Music seems more amorphous, like it's this thing that's hard to specify.
Like, what is, you know, if you look at my playlist,
what is the music that I love? It's harder, it seems to be much more difficult
to specify concretely.
So how hard is it to build a taste vector?
It is very hard in the sense that you need a lot of data.
And I think what we found was that,
so it's not a stationary problem,
it changes over time.
And so we've gone through the journey of,
if you've done a lot of computer vision,
obviously I've done a bunch of computer vision in my past.
And we started with the handcrafted heuristics.
This is kind of in the music.
This is this, and if you consume this, you probably like this.
So we have, we started there,
and we have some of that still.
Then what was interesting about the playlist data
was that you could find these latent things
that wouldn't necessarily even make sense to you,
that could even capture maybe cultural references
because they co-occurred things
that wouldn't have appeared mechanistically,
either in the content or so forth.
So I think that, I think the core assumption is that there
are patterns in almost everything.
And if there are patterns, these embedding techniques
are getting better and better now.
Now, as everyone else, we're also using deep embeddings where you can encode binary values
and so forth.
And what I think is interesting is this process to try to find things that do not necessarily,
you wouldn't actually have guessed.
So it is very hard in an engineering sense
to find the right dimensions.
It's an incredible scalability problem
to do for hundreds of millions of users
and to update it every day.
But in theory, in theory embeddings
isn't that complicated?
The fact that you try to find some principal components
or some like that, dimensionality reduction and so forth. So the theory, I guess, is that the
practice is very, very hard. And it's a huge engineering challenge. But fortunately, we
have some amazing both research and engineering teams in this space.
Yeah, I guess the question is all, I mean, it deal with the autonomous vehicle space is the question is how hard is driving and
here is
basically the question is of edge cases
So embedding probably works
Not probably but I would imagine works well in a lot of cases
So there's a bunch of questions that arise then.
So do song preferences?
Does your taste vector depend on context, like mood?
Right?
So there's different moods.
And then absolutely.
So how does that take in a, is it possible to take that
into consideration
or do you just leave that as an interface problem
that allows the users to just control it?
So when I'm looking for workout,
music I kind of specify it by choosing certain players
doing certain search.
Yeah, so that's a great point.
And it's back to the product development.
You could try to spend a few years trying to predict
which mood you're in automatically
when you open Spotify or you create a tab which is happy and sad, right? And you're going to be
right 100% of the time with one click. Now, it's probably much better to let the user tell you
if they're happier as that, if they want to work out. On the other hand, if you're used to
interface become 2000 tabs, you're introducing so much friction so no one will use the product,
so then you have to get better. So it's this thing where I think it maybe was, I remember who coined it, but it's called
Fall Tolerant DUIs, right?
You build a UI that is tolerant to being wrong.
And then you can be much less right in your algorithms.
So we've had to learn a lot of that.
Building the right UI that fits where the machine learning is.
And a great discovery there, which was by the teams during one of our hack days,
was this thing of taking discovery, packaging it into a playlist and saying that
these are new tracks that we think you might like based on this.
And setting the right expectation made it a great product.
So I think we have this benefit that, for example,
Tesla doesn't have that, we can change the expectation.
We can build a full tolerance setting.
It's very hard to be full tolerance
when you're driving at a 100 miles per hour or something.
And we have the luxury of being able to say that
of being wrong if we have the right UI which gives us
Different abilities to take more risk. So I actually think the self-driving problem is much harder. Oh, yeah, for sure
It's much less fun because
People die exactly and since Spotify
It's such a more fun problem because failure will, I mean,
failure is beautiful in a way, at least exploration. So it's a really fun reinforcement learning
problem. The worst case scenario is to get these WTF tweets. How did I get this song?
Which is a lot better than the self-driving phase. Exactly. So what's the feedback that a user,
what's the signal that a user provides into the system?
So the, you mentioned skipping.
What is like the strongest signal is,
you didn't mention clicking like.
So, we have a few signals that are important.
Obviously playing, playing through.
So one of the benefits of music, actually, even compared to podcasts or movies is the
object itself is really only about three minutes.
So you get a lot of chances to recommend.
And the feedback loop is every three minutes instead of every two hours or something.
So you actually get kind of noisy, but quite fast feedback.
And so you can see if people played through,
or if they're just the inverse of skip really.
That's an important signal.
On the other hand, much of the consumption
happens when your phone is in your pocket,
maybe you're running or driving or you're playing on a speaker.
And so you're not skipping.
It doesn't mean that you love that song.
It might be that it wasn't bad enough
that you would walk up and skip.
So it's a noise to signal.
Then we have the equivalent of the like, which is you save it to your library.
That's a pretty strong signal of affection.
And then we have the more explicit signal of play listing.
Like you took the time to create a playlist, you put it in there.
There's a very little small chance that if you took all that trouble,
this is not a really
important track to you. And then we understand also what other tracks it relates to. So we have
we have the play listing, we have the like, and then we have the listening or skipped. And you have
to have very different approaches to all of them because at different levels of noise, one is very
voluminous, but noisy and the other is rare, but you can probably
trust it. Yeah, it's interesting, because I think between those signals captures all the
information you'd want to capture. I mean, there's a feeling, a shallow feeling for me that
there's sometimes I'll hear a song and there's like, yes, this is, you know, this was the
right song for the moment, but there's really no way to express that fact,
except by listening through it all the way.
Yeah, and maybe playing it again at that time or something.
But there's no need for a button that says,
this was the best song I could have heard at this moment.
Well, we're playing around with that,
with kind of the thumbs up concept,
saying like, I really like this,
just kind of talking to the algorithm.
It's unclear if that's the best way for humans to interact.
Maybe it is.
Maybe they should think of Spotify as a person and agent sitting there trying to serve
you and you can say like, that's Spotify, good Spotify.
Right now the analogy we've had is more you shouldn't think of of us, we should be
invisible.
And the feedback is if you save it, it's kind of you work for yourself.
You do a playlist because you think it, you work for yourself, you do a playlist
because you think it's great,
and we can learn from that.
It's kind of back to Tesla,
how they kind of have this shadow mode,
they sit in what you drive,
we kind of took the same analogy,
we sit in what you play list,
and then maybe we can,
we can offer you an auto pilot,
we can take over for a while or something like that,
and then back off if you say,
that's not good enough,
but I think it's interesting to figure out what your mental model is.
If Spotify is an AI that you talk to, which I think might be a bit too abstract for many
consumers, or if you still think of it as it's my music app, but it's just more helpful.
And depends on the device that's running on, which brings us to smart
speakers. So I have a lot of the Spotify listening I do is on things that
I'm device that can talk to whether it's from Amazon Google or Apple. What's the
role of Spotify and those devices? How do you think of it differently than on the
phone or on the desktop? There are a few things to say about.
But first of all, it's incredibly exciting.
They're growing like crazy, especially here in the US.
And it's solving a consummate that I think is,
is you can think of it as just remote interactivity.
You can control listening from across the room.
And it may feel like a small thing,
but it turns out that friction matters to consumers.
Being able to say play, pause, and so forth
from across the room is very powerful.
So basically you made the living room interactive now.
And what we see in our data is that the number one
use case for these speakers is music, music and podcast.
So fortunately for us, it's been important to these companies
to have those use case coverage.
So they want its Spotify on there,
so we have very good relationships with them.
And we're seeing tremendous success with them.
What I think it's interesting about them is,
it's already working.
We kind of had this at Piffene many years ago
back when we started using Sonos.
If you went through all the trouble
of setting up your Sonos system,
you had this magical experience where you had all the music ever made in your living room. And
we made this assumption that the home, everyone used to have a CD player at home, but they
never managed to get their files working in the home. Having this network attached storage
was too cumbersome for most consumers. So we made the assumption that the home would
skip from the CD all the way to streaming books where you would buy the steering wheel without all the music built in.
That took longer than we thought, but with the voice speakers, that was the unlocking
that made kind of the connected speaker happen in the home.
So it really exploded and we saw this engagement that we predicted would happen.
What I think is interesting though is where it's going from now.
Right now, you think of them as voice speakers,
but I think if you look at Google I-O, for example,
they just added a camera to it where, you know,
when the alarm goes off, instead of saying,
hey Google stop, you can just wave your hand.
So I think they're going to think more of it as an agent,
or as an assistant, truly an assistant,
and an assistant that can see you.
It's going to be more effective than a blind assistant.
So I think these things will morph,
and we won't necessarily think of them as, quote-unquote,
voice speakers anymore, just as interactive access
to the internet in the home.
But I still think that the biggest use case for those will be audio.
So for that reason, we're investing heavily in it.
And we built our own NLU stack to be able to...
The challenge here is, how do you innovate in that world?
It's a lower risk for consumers, but it's also much more constrained.
There, you have no pixels to play with in an, in an audio only world.
It's really the vocabulary that is the interface.
So we started investing and playing around quite a lot with that,
trying to understand what the future will be of you speaking and gesturing and waving at your music.
And actually, you're actually nudging closer to the autonomous vehicle space,
because from everything I've seen, the level of frustration people experience upon failure
of natural language understanding is much higher than failure in other contexts. People
get frustrated really fast. So if you screw that experience up even just a little bit,
they give up really quickly. Yeah. And I think you see that in the data.
While it's tremendously successful,
the most common interactions are play, pause, and you know, next.
The things where if you compare it to taking up your phone, unlocking it,
bringing up the app and skipping, clicking skip, it was, it was much lower friction.
But then for longer, more complicated things, like can you
find me in that zone, people still bring up the phone and search and then play it on their
speaker.
So we try again to build a fault tolerant UI, where for the more complicated things, you
can still pick up your phone, have powerful, full keyboard search, and then try to optimize
for where there is actually lower friction and try to, it's kind of like the test autopilot thing. You have to be at the level where you're helpful. If you're too smart
and just in the way, people are going to get frustrated.
And first of all, I'm not obsessed with stairway to happen. It's just, it's all, but let
me mention that as a use case because it's an interesting one. I've literally told one
of, I don't want to say the name of the speaker because it'll, when people are listening to it, it'll make their speaker go off. But I talk to
the speaker and I say, play stairway to heaven. And every time, it, like, not every time,
but a large percentage of time plays the wrong stairway to heaven. It plays like some cover
of the, and that part of the experience, I actually wonder from a business perspective,
the Spotify control that entire experience or no.
It seems like the NLU, the natural language stuff, is controlled by the speaker, and then
Spotify stays at a layer below that.
It's a good and complicated question. Some of which is dependent on the
partner, so it's hard to comment on the specifics, but the question is the right one.
The challenge is if you can't use any other personalization, I mean, we know which
stairway to have and the truth is, maybe for one person, it is exactly the cover
that they want and they'd be very frustrated if a place,
I think we default to the right version,
but you actually want to be able to do the cover
for the person that just played the cover 50 times
or Spotify is just going to seem stupid.
So you want to be able to leverage the personalization,
but you have this stack where you have the ASR
and this thing called the NBest List,
or the NBest Guest List, here, and then the personization comes in at the NBEST list or the NBEST guesses here.
And then the presentation comes in at the end.
You actually want the presentation to be here
when you're guessing about what they actually meant.
So we're working with these partners.
And it's a complicated thing where you want to,
you want to be able, so first of all,
you want to be very careful with your users data.
You don't want to share your users data without the permission,
but you want to share some data so that their experience gets better
so that these partners can understand enough, but not too much and so forth.
So it's really the trick is that it's like a business-driven relationship
where you're doing product development across companies together,
which is really complicated.
But this is exactly why we built our own
NLU so that we actually can make personalized guesses because this is the
biggest frustration. From a user point of view, they don't understand about ASRs
and NBEST lists and business deals. They're like, how hard can it be? I've told
this thing 50 times, this version, and still plays the wrong thing. It can't be
hard. So we try to take that user approach.
If the user, the user's not going to understand
the complications of business, we have to solve it.
Let's talk about sort of a complicated subject
that I myself am quite torn about.
The idea of sort of of paying artists, right?
sort of of paying artists, right? I saw as of August 31, 2018, over $11 billion will pay to rights holders. So, and further distribute to artists from Spotify. So a lot of money is
being paid to artists. First of all, the whole time is a consumer for me when I look at Spotify, I'm not sure,
I'm remembering correctly, but I think you said exactly how I feel, which is, this is too good
to be true. Like, when I start using Spotify, I assume you guys will go bankrupt in like a month.
It's like, this is too good. A lot of people did.
This is too good. A lot of people did.
This is amazing.
So one question I have is sort of the bigger question, how do you make money in this complicated
world?
How do you deal with a relationship with record labels who are complicated, these big, you're essentially have the task of hurting cats, but like rich
and powerful cats.
And also have the task of paying artists enough and paying those labels enough and still
making money in the internet space where people are not willing to pay hundreds of dollars
a month.
So how do you navigate
the space? That's a beautiful description. Hearding rich cats.
I've never heard that before. It is very complicated and I think, yeah, certainly, actually betting
against Spotify has been statistically a very smart thing to do, just looking at the line of roadkill in music streaming services.
It's kind of, I think, if I understood the complexity when I joined Spotify, fortunately,
I didn't know enough about the music industry to understand the complexities,
because then I would have made a more rational guess that it wouldn't work.
Ignorance is bliss.
I would have made a more rational guess that it wouldn't work. So, you know, ignorance is bliss.
But I think there have been a few distinct challenges.
I think, as I said, one of the things that made it work at all was that Sweden and the
Nordics was a lost market.
So there was no risk for labels to try this.
I don't think it would have worked if the market was healthy.
So that was the initial condition.
Then we had this tremendous challenge with the modlet itself.
Now, most people were pirating, but for the people who bought a download or a CD,
the artist would get all the revenue for all the future plays then. So you got it all up front, whereas the streaming model was like almost nothing they won, almost nothing they too.
And then at some point, this curve of incremental revenue would intersect with your day one payment.
And that took a long time to play out before the music labels, they understood that,
but on the artist side, it took a lot of time to understand that actually,
if I have a big hit that is going to be played for many years, this is a much better model,
because I get paid based on how much people use the product,
not how much they thought they would use it, they won or so forth.
So it was a complicated model to get across, and but time helped with that, right?
And now the revenues to the music industry
actually are bigger again than, you know,
it's going through this incredible dip
and now they're back up.
And so we're very, we're very proud
of having been a part of that.
So there have been distinct problems.
I think when it comes to the labels,
we have taken the painful approach.
Some of our competition at the time, they kind of looked at other companies and said,
if we just ignore the rights, we get really big, really fast.
We're going to be too big for the labels to kind of too big to fail, they're not going to kill us.
We didn't take that approach. We went legal from day one.
And we negotiated and negotiated and negotiated. And negotiated was very slow, it was very frustrating. We were angry at seeing other
companies taking shortcuts and seeming to get away with it. It was this this
game theory thing where over many rounds of playing the game, this would be the
right strategy. And even though clearly there's a lot of frustrations at times
during renegotiations, there is this weird trust where we have been honest
I'm fair, we've never screwed them, they've never screwed us, it's ten years, but there's this trust
in like they know that if music doesn't get really big, if lots of people do not want to listen to music,
I want to pay for it, Spotify has no business model. So we actually are incredibly aligned, right? Other companies, not to be tensed, but other companies have other business
models, where even if they may need music from, no money for music, there still be profitable
companies, but Spotify won. So I think the industry sees that we are actually aligned business-wise.
So there is this trust that allows us to do product development, even if it's scary,
taking risks. The free model itself was an incredible risk for the music industry to take,
that they should get credit for. Now some of it was that they had nothing to lose in Sweden,
but frankly, a lot of the labels also took risk. And so I think we built up that trust with the,
I think, hurting or cats sounds a bit, what's the word?
It sounds like, just miss of the cat.
Dismiss it.
No, every cat matters.
They're all beautiful and very important.
Exactly.
They've taken a lot of risks.
And certainly it's been frustrating on both sides.
It's done a lot of good.
Yeah.
So it's really like playing, it's game theory.
If you play the, if you play
the game many times, then you can have the statistical outcome that you bet on. And it
feels very painful when you're in the middle of that thing. I mean, there's risk. There's
trust, there's relationships from just having read the biography of Steve Jobs, similar
kind of relationships were discussed in iTunes. The idea of selling
a song for a doll was very uncomfortable for labels. And there was no, it was the same kind
of thing. It was trust. It was game theory. It has a lot of relationships that had to be
built. And it's really a terrifyingly difficult process that Apple could go through a little bit because they could
afford for that process to fail for Spotify. It seems terrifying because you can't.
Initially, I think a lot of it comes down to, you know, honestly, Daniel and his tenacity in
negotiating, which seems like an impossible to siphon task, because he was completely unknown and so forth.
But maybe that was also the reason that it worked.
But I think, yeah, I think Game 3 is probably the best way
to think about it.
You could go straight for this like Nash equilibrium
that someone is going to defect, or you played many times you try to actually go for the top left
the corporations sell. Is there any magical reason why Spotify seems to have
won this so a lot of people have tried to do a Spotify try to do and Spotify's
come out well so the answer is that there's no magical reason because I don't believe in magic.
But I think there are reasons.
And I think some of them are that people
have misunderstood a lot of what we actually do.
The actual Spotify model is very complicated.
They've looked at the premium model and said,
it seems like you can charge $9.99 for music and people are going to pay.
But that's not what happened.
Actually, when we launched the original mobile product, everyone said they would never
pay.
What happened was they started on the free product and then their engagement grew so much that
eventually they said, maybe it is worth 999, right?
It's your propensity to pay gross with your engagement.
So we have this super complicated business model.
We operate two different business models,
advertising and premium at the same time.
And I think that is hard to replicate.
I struggle to think of other companies
that run large scale advertising
and subscription products at the same time.
So I think the business model is actually
much more complicated than people think it is.
And so some people went after just the premium part
without the free part and ran into a wall where no one
wanted to pay.
Some people went after just music should be free, just ads,
which doesn't give you enough revenue,
and doesn't work for the music industry.
So I think that combination is kind of opaque from the outside. So maybe
I shouldn't say it here and reveal the secret, but that turns out to be hard to replicate
than you would think. So there's a lot of brilliant business strategy here. Brillions are
luck, probably more luck, but it doesn't really matter. It looks brilliant in retrospect.
So let's call it brilliant. Yeah, when the books are written, they'll be brilliant.
You've mentioned that your philosophies
to embrace change.
So how will the music streaming and music listening world
change over the next 10 years, 20 years?
You look out into the far future.
What do you think?
I think that music and for that matter, audio, podcast, audio books, I think it's one of the few core human needs. I think it, there is no good reason to me why it shouldn't
be at the scale of something like messaging or social networking. I don't think it's a
niche thing to listen to music or news or something. So I think scale is obviously one of the things that I really hope for.
I think I hope that it's going to be billions of users.
I hope eventually everyone in the world gets access to all the world's music I've ever made.
So obviously I think it's going to be a much bigger business.
Otherwise we wouldn't be betting this big.
Now if you look more at how it is consumed,
what I'm hoping is back to this analogy
of the software tool chain,
where I think I sometimes internally,
I make this analogy to text messaging.
Text messaging was also based on standards
in the area of mobile carriers.
You had the SMS, the one under the 40 character, the one in the SMS.
And it was great because everyone agreed on the standard.
So as a consumer, you got a lot of distributions and interoperability,
but it was a very constrained format.
And when the industry wanted to add pictures to that format to do the MMS,
I looked it up and I think it took from the late 80s to early 2000.
This is like a 15, 20 year product cycle to bring pictures into that.
Now, once that entire value chain of creation and consumption
got wrapped in one software stack within something like Snapchat or WhatsApp.
Like the first week they added disappearing messages.
Like then two weeks, they added stories.
The pace of innovation when you're on one software stack, you can affect both creation and
consumption, I think it's going to be rapid. With these streaming services, we now, for the
first time in history, have enough, I hope, people on one of these services. Actually, whether
it's Spotify or Amazon or Apple or YouTube,
and hopefully enough creators that you can actually start working with the format again.
And that excites me. I think being able to change these constraints from 100 years,
that could really do something interesting. I don't, I really hope is not just going to be there.
Itteration on the same thing for the next 10 to 20 years as well.
Yeah, changing the creation of music,
the creation of audio, creation of podcasts,
is a really fascinating possibility.
I myself don't understand what it is about podcasts
that's so intimate, it's just it is.
I listen to a lot of podcasts.
I think it touches on a deep human need for connection.
The people do feel like they're connected to, when they listen.
I don't understand what the psychology of that is, but in this world is becoming more
and more disconnected.
It feels like this is fulfilling a certain kind of need.
Empowering the creator as opposed to just the listener is really interesting.
I'm really excited that you're working on this.
Yeah, I think one of the things that is inspiring for our teams to work on both the guests is
exactly that.
Whether you think like I probably do that it's something biological about perceiving
to be in the middle of the conversation that makes you listen in a different way.
It doesn't really matter.
People seem to perceive it differently.
And there was this narrative for a long time that, you know, if you look at video, everything
kind of in the foreground, I got shorter and shorter and shorter because of financial
pressures and monetization and so forth.
And eventually, at the end, there's almost like 20 seconds clip, people just screaming
something.
And I'm really, I feel really good about the fact that you could have interpreted that as people have
no attention span anymore.
They don't want to listen to things.
They're not interested in deeper stories.
People are getting dumber.
But then podcasts came along and it's almost like no, no, the need still existed.
But maybe it was the fact that you're not prepared
to look at your phone like this for two hours,
but if you can drive it the same time,
it seems like people really want to dig deeper
and they want to hear like the more complicated version.
So to me, that is very inspiring
that that podcast is actually long form.
It gives me a lot of hope for humanity
that people seem really interested in hearing
deeper, more complicated conversations.
This is a, I don't understand it.
It's fascinating.
It's a, the majority for this podcast, listen to the whole thing.
This whole conversation we've been talking for an hour and 45 minutes and somebody will,
I mean, most people will be listening to these words I'm speaking right now.
It's great.
Yeah, you wouldn't have thought that 10 years ago, where the world seemed to go.
So that's very positive, I think.
That's really exciting and empowering the creator.
And there's as really exciting.
Last question.
You also have a passion for just mobile in general.
How do you see the smartphone world, the digital space of smartphones and just everything
that's on the move, whether it's Internet of Things and so on, changing over the next
10 years and so on?
I think that one way to think about it is that computing might be moving out of these multipurpose devices,
the computer we had in the phone,
into specific purpose devices.
And it will be ambient, that at least in my home,
you just shout something at someone.
And there is always one of these speakers close enough.
And so you start behaving differently.
It's as if you have the internet ambiently around you
and you can ask it things.
So I think computing will kind of get more integrated.
And we won't necessarily think of it
as connected to a device and the same thing
in the same way that we do today.
I don't know the path to that maybe we used to have these desktop computers and then we partially
replaced that with the laptops and left the you know, we had desktop at home and our work and then
we got these phones and we started leaving the laptop at home for a while and maybe the
maybe for stretches of time you're going to start using the watch and you can leave your
your phone at home like for a runner something and you'll
We're on this progressive path where
You I think what what is happening with the voice is that
You haven't you have an interactive interaction paradigm that doesn't
require as large physical devices. So I definitely think there's a future where you can have
your airports and your watch.
And you can do a lot of computing.
And I don't think it's going to be this binary thing.
I think it's going to be like Manoevo still have a laptop.
We just use it less.
And so you shift your consumption over.
And I don't know about AR glasses of so forth.
I'm excited about it.
I spent a lot of time in that area,
but I still think it's quite far away.
AR VR.
All of this.
VR is happening and working.
I think the recent Oculus Quest is quite impressive.
I think AR is further away, at least that type of AR.
I think, but I do think your phone or watch your glasses
understanding where you are and maybe what you're looking at
and being able to give you audio cues about it,
or you can say like, what is this?
And it tells you what it is.
That I think might happen, you know,
you use your watch or your glasses as a mouse pointer
on reality.
I think it might be a while before, it might be wrong. I hope on reality. I think it might be a while before,
I might be wrong. I hope I'm wrong. I think it might be a while before we walk around with these
big like lab glasses that project things. I agree with you. There's a, it's actually really difficult
when you have to understand the physical world enough to project onto it. Well, I lied about the last question, because I just thought of audio and my favorite
topic, which is the movie her. Do you think, well, there's part of Spotify or not, we'll
have, I don't know if you've seen the movie her. Absolutely. And there audio is the primary form of interaction and the connection with another entity that
you can actually have a relationship with it.
You fall in love with based on voice alone, audio alone.
How far do you think that's possible, first of all, based on audio alone to fall in love
with somebody?
Somebody or, well, yeah, let's go with somebody. Just have a relationship based on audio alone.
And second question to that,
can we create an artificial intelligence system
that allows one to fall in love with it
and her him with you?
So there's my personal answer.
Ah.
Speaking for me as a person,
the answer is quite unequivocally yes, personal answer. Ah. Speaking for me as a person, the answer is quite unequivocally,
yes, on both.
I think what we just said about podcasts and the feeling
of being in the middle of a conversation,
if you could have an assistant where,
and we just said that feels like a very personal setting.
So if you walk around with these headphones
and this thing, you're speaking with this thing all
of the time, that feels like it's in your brain. I think it's it's going to be much easier to fall
in love with than something that would be on your screen. It can be entirely possible. And then from
the you can probably answer is better than me, but from the concept of if it's going to be possible to
build a machine that they can achieve that, I think whether think whether you think of it as if you
can fake it, the philosophical zombie that assimilates it enough or it somehow
actually is, I think there's, it's only question, if you asked me about time, I'd
have to be a financier, but if you say a given some half-infinite time,
absolutely, I think it's just atoms and arrangement of information.
Well, I personally think that love is a lot simpler than people think.
So we started with true romance and ended in love.
I don't see a better place to end.
Beautiful.
Gustav, thanks so much for talking today.
Thank you so much. It was a lot of fun.
It was fun. on.