a16z Podcast - Personal Genomics: Where Are We, Really?
Episode Date: January 6, 2020This is a turn of the decade (and January-themed) look backward/ look forward into personal genomics, given recent and past retrospective and prospective pieces in the media on the promise, and perils..., of the ability to sequence one's DNA: What did it, and does it, mean for personalized medicine, criminal investigations, privacy, and more?General partner Jorge Conde, who has a long history in the space, covers everything from where genealogy databases and large datasets come in to fetal testing, multi-omics, and other themes spanning the past, present, and future of personal genomics in conversation with Sonal Chokshi for episode #18 our news show 16 Minutes, where we cover recent headlines, the a16z way, from our vantage point in tech -- and especially what's hype/ what's real. While we typically cover multiple headlines, this is one of our special deep-dive episodes on a single topic. (You catch up on other such deep dives, on the opioid crisis and other evergreen episodes, at a16z.com/16Minutes). And if you haven't already, be sure to subscribe to the separate feed for "16 Minutes" to continue getting new episodes. image: Petra Fritz / Flickr
Transcript
Discussion (0)
Hi everyone. Happy New Year. I'm Sonal. As you may know, we launched a new short form new show last year, 16 minutes, where we cover recent news the A6 and Z podcast way. What's hype, what's real, why they matter from a vantage point in tech. And that show has continued in a separate feed for quite some time now. You can subscribe to it if you haven't already in your podcast app by searching for 16 minutes A6 and Z. But I'm also sharing the latest episode here in this show feed, since we sometimes cover not just multiple news items, but a single topic prompted by recent
headlines like we did on our episodes on e-sports and the opioid crisis.
This week, the topic is personal genomics, the promise, the perils, where are we really today,
and where could we be going next?
We start with an article by Peter Aldhaus, unquote, 10 years ago, DNA tests were the future
of medicine.
Now there is social network and a data privacy mess.
The article refers to a series of events, everything from companies like 23 and me and the FDA,
to some of the headlines we've seen lately around criminals being caught based on their
relatives' DNA. There's also a number of companies cited in the article who offer such tests.
To be clear, none of the following discussions that should be taken as investment advice.
Please see a6.com slash disclosures for important information. So that's a context. And super
quick summary. Now, let me introduce our A6 and Z expert, general partner Jorge Condé,
who has a long history in this area. Since it's a turn of a decade and the first episode of
January, I thought it'd be great for us to do sort of a Janus-themed look-back, look forward,
starting with quick reactions on reading the piece.
Well, when I read the BuzzFeed piece, which was super interesting, it took me back to a very
specific moment in time.
And I was living in this world.
I was in the personal genomic space.
I had just started a startup that was looking to essentially interpret full genome data at
scale.
If there was something in DNA that could be found to be relevant or actionable, we were building
the technology to detect that.
But what I thought was really neat is I'm reading this 10-year retrospective.
If I go back to that moment in time, I actually participate.
in a piece that was in some way a 10-year prospective look on what the future of personal
genomics would look like. And this is in the 2008 time frame, more or less, I get an outreach
from, of all things, GQ magazine. They had an author, a guy by the name of Richard Powers, who had
just written a book, had won all kinds of awards. He wanted to write about the experience of what
it would mean to have his full genome sequenced and essentially revealed to him. And we had
started this company gnome with the idea that we would be among the first to fully sequence
individuals and interpret their DNA for them. But what's really interesting is if you almost
read that piece as a companion to this backward looking look, you get the forward looking look
of what the next 10 years in personal genomics would look like. What was it called? It was called
The Book of Me. Oh, fantastic. What a great title. So then what is your take? What's hype?
What's real here when it comes to the promise of personal genomics? The whole complaint of this
article is that we were promised one thing. They were supposed to be the future of medicine,
but hey, instead we got this big data privacy mess.
So looking 10 years back of what was hype,
or at least over-expectation,
was that people in general would have a deep curiosity
to understand their DNA.
You're saying that part is hype?
I would think that part is reality.
Ah, well, what's really interesting is if you look at several companies named,
I think all had at some level an idea
that there would be a large number of people
that wanted to very deeply understand any sort of secrets or actionable insights that you could draw
from your own genomic information.
And while those people definitely exist, I don't think that a large market materialized around
those people.
In fact, one of the eye-opening things for me when I was starting my company back in 2008,
Ancestry.com was primarily selling subscription services for getting into these sort of Ancestry
databases.
Yeah, online family trees.
Yes.
So I remember I downloaded the S1, Ancestry.com's subscription.
subscription revenue with something on the order of $200 million that year.
So the question is, do people fundamentally want to understand their DNA in terms of health,
risks, and the like, or do people have a fundamental curiosity to know who they are and where they
come from?
Oh, that's where you're saying the difference between what's the actual market for this kind of,
there's a curiosity, but not necessarily a market for DNA around it.
Exactly.
So people want to understand who they are and where they come from.
And if that happens to come from DNA data, great.
If that happens to come from looking at ancestry databases,
that seems to be a pretty reasonable substitute
for getting that insight.
So, okay, so you're saying one of the things that's hype
is that people may not necessarily want DNA data specifically.
What else is hype?
I think one of the other things that was potentially hype,
certainly at that time in 2008, 2008, 2009, 2010 time frame
is that there would be something deeply concrete
about DNA that would determine
what your potential health risks
and therefore what your potential outcomes might look.
like. You know, sort of it's this idea that DNA is destiny when it comes to your health. Now,
that's certainly true in some subset of diseases. The subset of diseases that are known to be
monogenic. Right. So single factorial driving it. Exactly. When there is a mutation in a gene
that results in a specific condition like a sickle cell anemia. Right, which we talked about in our
CRISPR episode. But when you start to look at things that are much more complex, much more
multifactorial. Like cancer, many other diseases. Cancer, metabolic disorders, you know, pick any number
of cardiovascular risks. There are certainly genetic contributors, but as a lot of experts in the
field say, is you probably get that same level of information from getting a good family history.
Right. So basically, the second hype piece you're saying is that it is not a direct link,
a map from, oh, here's your DNA, and then, oh, here's all the diseases you're going to get,
not get, et cetera. And here's the precise risk you have for this disease based on me analyzing your
DNA. So I think that's probably an area where expectations were probably higher than where we
were in reality in terms of how actionable is this information for someone that is seeking to manage
their health. So that's maybe one of the things where maybe the promise hasn't quite come through
yet. And I think another area that is really interesting is a lot of these businesses were
conceived as subscription businesses, where I would give someone a DNA kit for Christmas, they would
get their genome scan, and then they would engage with that on some regular basis. And I would
suspect that the vast majority of people that had those DNA scans done, oh, so many
Christmases ago, probably haven't logged in in a while. So if you had a genome scan done
in 2009 and you did another genome scan in 2019, I can almost guarantee you that your
ancestral makeup would look different over the course of those 10 years. Simply because of the
available data. We just know more. That's right. You, of course, haven't changed who you are,
but who an ancestry map tells you are has changed. Okay. So that's where
maybe things were hyped or not delivered yet or promised and didn't quite come through.
Now let's quickly talk about the reality. So where are we today? What is possible right now truly
with personal genomics? Well, the first thing I would say is genomics more broadly has delivered
a lot over the course of the last 10 years. In fact, I will say this as an expert, not as an
entrepreneur in the genomic space, but as a parent of many children. One of the fascinating
things that I saw was the time my wife was pregnant with her oldest child,
you still could not get enough of a signal from maternal blood
as to whether or not there was sufficient fetal DNA in circulation
to determine whether or not there were genetic abnormalities.
By the time we had our last child, that was routine, standard of care.
The other example is, you know, when the child is actually born,
the mandated genetic tests when a child is born by a state,
some of the ones you could also opt into,
that menu of tests that were available multiplied in the relatively few years
between the time when we had our first child and our last child.
So roughly a decade span.
So we've seen a lot of advance just in the use of genetic information
and the practice of medicine.
And that's a remarkable advance forward.
So now let's focus on personal genomics specifically.
One of the promises of personal genomics,
even back in 2009, was predicated on the fact
that there would be power in numbers.
And in large part, that's why some of the leaders in this space,
whether it's 23 and me or Ancestry.com,
that eventually came into this,
there's so much value in them amassing a large database.
Because in some ways, as you have more samples in a database,
you get better reads on who we are, just genealogic.
You have a higher resolution map of the world.
Now, the risk of having a large aggregated data set,
it also becomes attempting target,
sometimes for legitimate uses, for investigation,
sometimes perhaps for illegitimate uses.
Right, this is where privacy concerns come in, exactly.
Very interestingly enough,
this privacy question, it sounds very,
futuristic, but even in 2009, these concerns were very real. If you read the terms and conditions
that services had to their credit, they were very explicit that this information, you know, could
be used in unintended ways. Oh, in fact, the article even points out that one of the companies
had to actually expand their definition of a violent crime in order to cover it in their terms
and services. And secondly, that some of them are actually moving to opting in, to whether you can
even be included in that aspect of that database, which is also fascinating. People can actually
choose. Well, in Nome, we made a decision early on where we said we're actually not going to
aggregate all of the data. We're not going to centralize it. We did something inverse. What we
decided to do was we would sequence an individual and place that sequence that genomic data
on an encrypted key that would live in a decentralized network. And the thought was you could keep
the queries centralized. So let's say a researcher wanted to understand how many people in a population
and have this mutation associated with this disease,
you would push the queries down to the edges of the network,
the analysis would run locally,
the result, and only the result would come back, get centralized,
then you'd have an aggregated answer to that question.
That's fascinating.
Funnally, even though it's a very different example,
reminds me of differential privacy.
And that was also something that, you know,
Apple made a bigger deal about in the last few years,
but in fact was based on a paper from Microsoft researchers
like a decade ago.
It's a fundamental insight they have for how to separate these two things.
So it's kind of funny, the synchrony of all that.
Yeah, and arguably we were 10 years too early.
That's always about timing.
The other thing is when we thought through these questions of privacy,
a lot of these were perceived risks.
We didn't know, but we wrote a lot of risk factors out to getting yourself sequenced.
And among them, we had things that sound fantastical.
Like if someone had an entire readout of your genome,
they could essentially synthesize your genome
and then plant your DNA at a crime scene.
Right. Fascinating.
Right.
And all of a sudden you have these genetic fingerprints,
a place where you've never been.
My co-founder, George Church,
who's a professor genetics at Harvard,
medical school, you know, he insisted that we include, if someone were getting sequenced,
we couldn't ask them to get buy-in from all of their family members, but we could require that
if any of them had a twin, an identical twin, that that twin would also have to sign up.
Of course. That makes perfect sense. So, okay, was there anything else on what is possible
right now on the personal genomics front? Oh, yeah. So in the present, you could argue that on the
ancestry side, we're getting much better at sort of getting a high-resolution view of who we are
and where we come from and all of that. And it's an end of one example, but if you take the case
of 23 and me, and over the course of a decade has amassed a large enough genomic data set
that it's clearly valuable from a research and development standpoint. It wasn't that long ago
when they announced a collaboration with GlaxoSmithK with GSC, where GSC is essentially
paying them something on the order of $300 million to get access to this data set to be able
to drive some insights from it and potentially even follow up with people on a very opt-in basis.
And so, you know, that will show you that at least on the original promise of person in genomics,
that this is one example.
That's been delivered, right?
And so there is power in numbers.
And I think the question like with any other technology, with any other resources, can we find the right balance where we're benefiting the comments and not at the expense of the individual?
And I think that's where a lot of the debate happens in terms of are we doing the right things.
So that's where we are now.
Let's talk about the future, since we're doing this whole Janus-themed episode.
So given that there was this past of promise that was and wasn't delivered,
present of where we are, where are we going with personal genomics next? Or what is actually
possible based on what we already know today? Yeah, well, I think there are certain things that
are possible based on what we know today. The first one is, as these data sets become more
rich, the ability to derive insights from them, that'll be relevant for how we diagnose or treat
disease, I think that becomes increasingly more valuable over time. So what I mean by that is
one of the big knocks on drug discovery and development is that it takes a long time, it's very
expensive and the risk of failure is high. One of the sort of lesser known data points is that
if you have a genetic insight driving the program saying that I think that a particular molecule
or compound is going to be effective in a particular patient population that's defined by some
sort of genetic or genomic marker, that molecule, that compound, that drug has a much higher,
significantly higher chance of success. What does that mean practically? Does it mean that we can
actually basically, is it natural extrapolation of that, that there may be a future where we do
get personalized tailored medicine based on those molecules?
That's right.
The extrapolation of that is that we'll get better, faster, cheaper drugs that are tailored to
the right population.
So they're like personalized cocktails at a mass manufactured level.
Essentially, yeah, personalized cocktails of therapies that at least are targeted to specific
populations.
That's actually the better way of saying that.
Another potential future thing, especially we're doing a 10-year perspective look from
today, is people talk about personal genomics.
Genomics is but one omic.
Ah, yes. Multimics.
That's right.
Very big thing.
If the big revolution over the course of the last 10 years is that we were able to sequence,
you know, read DNA at a massive scale at a low cost, at high fidelity and all those things,
that's increasingly true across many other ways in which biology transmits information.
You know, and I can bore you with all of the omics, but...
Go through a couple of the hit list.
I mean, proteomics is one I know from when I was at park.
So genomics is DNA, proteomics is protein, transcriptomics is all.
RNA, epigenomics is gene regulation and how genes levels are set, metabolomics, the set
of metabolites in your body, and of course microbiomics, how the microbiome influences with all
that. We're increasingly going to read biology across many, many frequencies, and by the way,
we can also increasingly read biology at higher and higher resolution, which means you could read
all of this information, not for a single individual, but increasingly from a single cell.
and that's a very different thing
because now you could, for example,
in a tumor, you could understand
how are the immune cells reacting to the tumor cells?
How are the tumor cells reacting to the immune cells?
If you can read biology at that resolution,
we're going to learn a lot more about biology.
Now, when you add to the fact that it's not just omics
being transmitted by the cell that we can capture at high fidelity,
but increasingly we have more censored data than ever before,
more ability to crunch data than ever before.
I think if you look over the course the next 10 years,
it won't be a question of personalized genomics.
I think that will, at some level, be a dated term.
It'll be the question of, you know, can you quantify individuals fully?
And we're getting closer and closer to that.
What would you say, though, I have to ask this because we don't want to be sitting here 10 years from now and asking,
so what did we get wrong 10 years ago, Jorge, when you and I talked about these topics?
Is it possible that Multihomics is also one of these much-hyped things as well?
I mean, we can't predict the future, obviously.
Things play out.
It's always a matter of timing, sometimes, when, not if.
Where are we really on the spectrum of hype versus reality?
Yeah, it's a good question.
I think if we take the last 10 years as any guide, there tends to clearly be sort of two stages,
two phases, two ages.
The first age is using technology to learn, and the second age is to use technology to act.
If we look at what happened with personal genomics is that first age where we took to learn
to gather data ended up being, I think, a lot longer than people probably originally anticipated.
And we're seeing the benefits of how we can act on that, you know, towards the tail end of the last 10 years.
I think it's probably reasonable to assume that if we look over the course of the next 10 years, to your question, the dividing line between hype and reality on something like multi-omics, for me, is really the dividing line and when do we shift from learning from information to acting on information.
So one thing I wondered about, frankly, is the parallels.
And your team talks about this a lot in terms of the parallels between engineering and the engineering phase coming to biology, which is that when it comes to DNA in genomics, the thing that's been most fascinating for me to watch over the last decade.
is that there is a Moore's Law in genomics,
and it's much faster than the regular Moore's Law,
and yet that pacing outcome of practical application
is not necessarily on par with what happened with the semiconductor.
So that's where the analogy really breaks down,
despite an accelerated effect.
So one question for me is,
what is missing in the ecosystem?
Like, is it that there isn't the ability to manufacture?
Is it a missing market, as you alluded to earlier?
Are there missing components or materials?
You know, when I think of the history of innovation,
what still needs to be built out,
in addition to this core fundamental technology
for this vision to come to reality?
Ah, that's a great question.
But first of all, I think the reason why we see
faster than Moore's law trend in genomics
is because the ability to sequence and interpret DNA
is really the confluence of three or four engineering marbles.
You know, if you look at the next generation sequencer,
really what it is is, you know,
you're tracking the history and evolution
of our ability to engineer better microfluidic systems
in other words, to move around tiny amounts of liquids.
All about microfluidics from Xerox.
There you go.
So you're seeing improvements in the ability to engineer better chemistry.
And this is both at the nucleotide level so we can get more efficient reactions.
And at the surface chemistry of the platform,
so you can actually run more and more reactions in tighter and tighter real estate.
So you get more density.
That's a second wave.
The third wave is we have massive improvements in optics.
So if you're going to run a bunch of chemical reactions and very, very, very, very,
small real estate, you need to be able to detect those.
Optical detection.
Optical detection.
So when the reactions that drive sequencing are occurring at such density that they fall below
the pixel detection level of the optics, you can't see the difference between them.
So we had to see improvement in optics.
And then all of that generated data that had to be deconvaluted with advanced computation.
And those are the four factors.
And those are the four factors.
Microfluidics, optical detection, improvements in chemistry, and data. Fantastic.
So that's that revolution.
So now why haven't we seen sort of the out?
output looked the same. The difference there is the output of Morse law is better and smaller
semiconductors. Those could be placed within a system that's been designed by people, like human beings,
that could be optimized, and therefore you can get new products. In the case of genomics,
the output of that information has to go into a system that was not designed by human beings.
It was designed by nature. So Jorge, bottom line it for me. So in this journey, from looking
backward and looking forward where are we in the personal genomics revolution and what should
our takeaway be we're still in the early days of this revolution if we look over the long course
of time we are still very much in the learning phase in the data collection phase in the
information gathering phase and it will be sometime before we make a mass shift into the taking
action phase or into the productization phase from all of this information but when you look
where we are heading, that day will arrive, and that's why we are incredibly optimistic about
what the future of genomics, of multilomics, and biology more broadly will bring to our benefit.
And when it comes to the privacy, in its full iteration, we will get the maximum power
from genomic information when virtually everyone is sequenced. If we have perfect information,
we can theoretically draw better insights. But that will come at important cost and considerations
for how we treat the concerns of individuals
that are contributing to that data
in a way that you're still protecting the individual
but still benefiting the comments.
Thank you for joining this episode.
My pleasure. Thank you for having me.