Behind The Tech with Kevin Scott - Dr. David Baker: Director of the Institute for Protein Design, University of Washington

Starting point is 00:00:00 Generally, in engineering and biology, if you want to solve a problem, you go look in nature for a protein that already does something similar, and then you modify it a bit. The relationship between sequence and structure has been really mysterious, but now we can actually design proteins with intent to do new things. So I think it's just getting to Behind the Tech. I'm your host, Kevin Scott, Chief Technology Officer for Microsoft. In this podcast, we're going to get behind the tech. We'll talk with some of the people who've made our modern tech world possible and understand what motivated them to create what they did. So join me to maybe learn a little bit about the history of computing

Starting point is 00:00:42 and get a few behind-the-scenes insights into what's happening today. Stick around. Hello and welcome to Behind the Tech. I'm Christina Warren, Senior Cloud Advocate at Microsoft. And I'm Kevin Scott. Our guest on the show today is Dr. David Baker. Dr. Baker is a professor of biochemistry and the director of the Institute

Starting point is 00:01:05 for Protein Design at the University of Washington. His research group is focused on the design of macromolecular structures and functions. Yeah, I really do think that Dr. Baker is doing some of the most interesting work in the world right now. I for sure, I mean, without question, I would be choosing to study computational biology if I were a graduate student again today. It's just so fascinating what he's doing. And I can't wait to hear him talk about what he's doing and to hear the two of you interact. So let's chat with Dr. Baker. Our guest today is Dr. David Baker. Dr. Baker is a biochemist and computational biologist who has pioneered methods to predict and design three-dimensional structures of proteins. He is the director of the Institute for Protein Design and professor of biochemistry at the

Starting point is 00:02:08 University of Washington. Dr. Baker has received awards from the National Science Foundation, the Beckman Foundation, and the Packard Foundation. He's published over 500 research papers, been granted over 100 patents, and co-founded 11 companies. Welcome to the show, David. Thank you. Happy to be here. So for listeners who aren't familiar with your work, can you tell us a bit about the University of Washington's Institute for Protein Design and the work that you do there? Yeah, sure. So in nature, proteins carry out essentially all the important functions in our bodies and in all living things. And they've come through billions of years, millions of years of evolution, and sort of been optimized to solve

Starting point is 00:02:50 the problems that were at hand during evolution. And what we've figured out at the Institute for Protein Design is how to make brand new proteins and to design them not to address problems that were relevant during evolution, but to address modern day current problems. And that's really what we're focused on at the Institute. It's such cool work. And I'm interested how you got interested in the field. Were you interested in science and math when you were a little kid? Were your parents scientists or engineers? Like what sparked the interest that has carried you to where you are today? Yeah, it's interesting. I actually was not terribly interested in science as a kid, perhaps because my parents were scientists. And when I went to college, I actually did not

Starting point is 00:03:38 initially major in science. I was initially a social studies major and then wanted to be a philosophy major. And then it was really my last year as an undergrad that I switched to biology. And then I hadn't really done research before, but I got really excited about what I had learned, what I had been learning in biology, which was developmental biology and neurobiology. And so I decided I would try out graduate school to see what it was like. And I found I liked doing research and I liked, you know, interacting with other people and sort of to solve hard problems. But as time went on, as I was a graduate student, then the next stage in my career, postdoc, I got more and more interested in sort of more basic questions.

Starting point is 00:04:16 And when I had taken my first biochemistry class when I was a senior in college, they had talked about the protein folding problem. And it seemed really interesting, but everyone said it was too hard to work on. And so but then when I as a postdoc, and then really, when I came to the University of Washington, that I decided to focus on trying to solve that problem. Yeah, I remember, I think in 1993, I was a research intern at the NCSA at the Beckman Institute, working in the computational biology group that Shankar Subramaniam was running there at the time. And he was working on protein folding. And I remember how hard it was in 1993.

Starting point is 00:04:52 So it's, I mean, you picked a challenging problem to get inspired by. Yeah, well, I always tell my students, even when you're in a group, you have to pick hard problems that aren't solved. Those are the only ones really worth trying to work on. And so at that moment when you were thinking about being a philosophy major and you decided to switch to biology, was that a magazine article you read, like an inspiring teacher, a movie you watched, a book, or just plain curiosity? It was, well, I think it was a little bit sort of getting fed up. or a movie you watched, a book, or just plain curiosity? It was, well, I think it was a little bit sort of getting fed up. I mean, I started realizing that more and more of those questions were sort of issues about language and language games and that there wasn't really, it started seeming to me

Starting point is 00:05:38 that there wasn't really a way to make consistent forward progress like discovery. I've always liked trying to discover things. And then when I took the biology class in contrast, there were just all these new discoveries right and left. People were just working out how the principles of developmental biology, and it just seemed like this huge unexplored territory that was much more ripe for exploration than philosophy would have been. So you made the switch and you went to graduate school at Berkeley and UC San Francisco. And how important was the computational part of biology when you were in grad school?

Starting point is 00:06:17 Did you have to learn not just biology, but a little bit about computer science while, you know, while you were a student, or did you have to learn that stuff later? Oh, yeah. Well, so I was a graduate student in the lab of Randy Sheckman, who worked on sort of cell biology, how proteins get moved, sorted around in cells. And at that time, well, I was famous and outspoken for ridiculing anyone who sat at a computer because, you know, what people use computers for, they was pretty rudimentary. You know, it was kind of word processing and, you know, you could you could waste time on computers the same way you can waste time on them now.

Starting point is 00:06:55 So I did not do any computer programming. And really, my major my major interaction with computers, like I said, was ridiculous to anyone who touched one. So, you know, in a little while, I really do want to get into like what changed between then and now, because, you know, I think we'll talk about soon how much of the work that you're doing happens by influence of some sort of digital technology. It's still, I mean, like still the biology is what's driving, but computers play a big role, right? Yeah, absolutely. I mean, yeah, we're probably one of the biggest users of computers in all of academia.

Starting point is 00:07:37 Yeah, which is so cool. But so what did things look like after you graduated? Like what were you working on? And like what did that stage of your things look like after you graduated? Like, what were you working on? And then, like, what did that stage of your career look like? Well, let's see. At the end of my PhD, I sort of discovered how to recreate in a test tube a very complicated biological processes, the ones that were involved in sorting these proteins to where they need to go in cells. And it seemed to me it was going to be a very long, complicated process to sort of purify all the proteins out and figure out their mechanisms. And I had been getting interested in sort of more basic questions about, you know,

Starting point is 00:08:10 the structures of proteins, which I really didn't know much about. And so then when I, for my post doc, I went to David Agard's lab at UCSF, and he studied structural biology and protein folding. And the first day when I went in, there was this computer terminal on my desk. And I asked, what was that for? And he said, it's for computing. So then I had to kind of learn what that was. So that was one of the things I did. But I do have another sort of funny story about that time, which is that whole area

Starting point is 00:08:39 of the building was really devoted to crystal structure determination, determining the atomic coordinates of proteins. So one of the first days I was there, I went into the room where everyone was sitting at their workstations trying to trace a protein chain through an electron density map. And so I said, Oh, can I try? So I said, Sure. So I sat down on the screens and tried to trace the chain through. And I remember then that I have horrible 3d visualization capabilities, I was totally incapable of visualization capabilities. I was totally incapable of doing it. So everyone in the room turned to me and said, David, shouldn't you have checked out whether you're any good at this stuff before you went into

Starting point is 00:09:11 structural biology? That is funny. So maybe you could explain to the listeners a little bit about what exactly that looks like. So like you get a protein, you try to crystallize it, like at the time you were shining a bunch of x-rays through it and sort of looking at the diffraction patterns of, I mean, like I'm probably describing. That's right. No, you're absolutely right. But so like that's just an incredible amount of work just to figure out what a protein looks like when it folds itself up, right? Yeah. Yeah. It's a lot of work and it's very difficult too, because you have to coax the proteins to crystallize. And I don't really, I don't think I, this is probably would be,

Starting point is 00:09:54 it's probably not exactly how it was, but at some point around there, I decided I really wanted to learn how to predict protein structure from sequence so that you wouldn't have to do all this crystallography and chain tracing and all this stuff but yeah and so yeah so so like talk talk about that a little bit like it's sort of a big leap i i guess from this idea that you're using a bunch of scientific tools in the physical world to try to spy into the world of biology much of which we can't see at all, you know, at this molecular level that you're, you know, that you're working at to trying to go from this sequence of amino acids and say, like, if I understand the sequence of acids, like, what is this thing going to look like?

Starting point is 00:10:38 And what a protein looks like dictates a lot of what its function is in an organism, correct? Yeah, you're absolutely right. There couldn't be anything more different than the two processes which have the same end goal, which is each protein, each gene in our genome encodes a unique protein that carries out a unique function. And it does so because the DNA sequence in the gene encodes a brick, their sequence of amino acids, which then folds up into a unique three-dimensional structure. So it had been known since the 60s that the amino acid sequence of a protein determines its precise three-dimensional structure. But like you said, the way that people actually have been

Starting point is 00:11:14 figuring out what the three-dimensional structures are is not reasoning from the amino acid sequence, but instead building hundreds of million dollar equipment and x-ray beams and all this complicated stuff to try and work out what the coordinates of the atom, where the atoms are. But all the information we know is in the amino acid sequence. And so then when I moved to the University of Washington, that was really the problem that I decided to focus on, like to really look at the simplest possible cases of protein folding and understand how they work and do that both with experiments to try and understand what the steps what what the process the key key aspects of the process were and what the determinants of protein folding were and then on the computer to try and develop methods for actually going straight from the sequence to the structure and so why is it

Starting point is 00:11:59 important to be able to predict the structure of proteins using a computer or any other mechanism. Yeah. Well, the same reason it's important to be able to determine the structure of proteins is because, like I said earlier, proteins carry out essentially all the important functions in our bodies and in everything in life. And so if you want to understand how biological processes work or how disease comes about, You have to understand the interactions between proteins. And those are kind of, a lot of them are kind of like Glock and key where things are fitting together very precisely. So you have to understand the geometry, how these structures fit together. And also if you want to understand how they work, like how they, how they generate,

Starting point is 00:12:39 they capture solar energy and convert it into formation of chemical bonds, all those things, you really have to understand the structures in the same way that if you want to know how a machine that does any arbitrary thing works, you really have to know what it looks like. Yeah. So we may be at a really unusual point in time where people probably know a little bit more about like protein structure and what its implications are than they ever had before because of the COVID-19 pandemic.

Starting point is 00:13:09 Correct. So, you know, maybe in terms of SARS coronavirus too, like, can you describe? Yeah, that's a great idea. Really good suggestion. In fact, now when I give talks, I explain protein design in the context of coronavirus. So let me just spend a couple minutes describing what we've been doing at the Institute with regard to coronavirus. So the genome sequence was determined and made available at the beginning of last year. So we took that amino acid sequence and used the methods we've been developing to predict the three-dimensional structure of the protein on the surface, the spike protein. Of course, you're right. There's higher literacy about this now

Starting point is 00:13:48 than there ever was. And we knew that the spike protein found the ACE2 receptor on the target cells. So starting initially with that model and then shifting over to the x-ray crystal structure when it was determined of the spike ACE2 complex. The first thing that we did was to design small proteins that we predicted would fold up into such a way that they'd have a shape and chemical complementarity to the part of the spike protein called the receptor binding domain that binds the ACE, binds ACE2. So these are like, I talked about sort of lock key interactions. So if you imagine the ACE2 is the key and the RBD is the lock, so it's sort of the spike protein goes and binds to the ACE2. We basically made things that would compete away that interaction that is bind more

Starting point is 00:14:38 tightly to the virus than ACE2. And we were able to make compounds that bind to the virus about a thousand times more tightly than ACE2. And this was really cool. They were just completely made up proteins, completely unrelated to anything that had been seen before. And with our collaborators, we were able to actually determine experimentally how these small proteins bind to the spike. And they bound basically exactly like in our computer model. So that means we could go from essentially from the sequence of a virus to these very, very tight high affinity binding proteins. And the next thing we showed was that those proteins block the virus from getting into cells. And then we showed with collaboration that they protect animals from infection by the virus. And I think this was kind of a real aha moment for me because we'd been developing these methods for designing proteins over the years.

Starting point is 00:15:27 And here in the midst of a pandemic, we were actually able to apply them to make therapeutic candidates. And those are now headed for clinical trials. It's been slow because this is a completely new modality, this whole idea of computational design proteins. So there's been a little bit of a pushback because these are completely new things. No one knows how exactly how they'll behave. But for the next pandemic, we're going to be ready. So we have all the methods worked out. And I think we've gotten a lot over a lot of the sociological issues to actually using these as drugs. And there's nothing really that can be as fast if you can go from the amino acid sequence to actually computing a

Starting point is 00:16:01 protein which fits perfectly against the virus. So that's the first thing we did. The second thing we did was to design, again, completely from scratch, little molecular devices that emit light, luminesce, when they encounter the virus. And those are pretty neat. We're developing those now for not only for detecting the virus, but also for monitoring responses to vaccination, like how good are my antibodies against the virus? And so rather than that being just like a fixed key that fits into a lock, that's actually a device that can undergo changes in its state when it encounters the virus. And the third area, my colleague Neil King at the Institute has been developing sort of a next generation of coronavirus vaccines using designed protein nanomaterials that we've created at the Institute, has been developing sort of a next generation of coronavirus vaccines

Starting point is 00:16:45 using designed protein nanomaterials that we've created at the Institute, which self-assemble into big things that look like death stars. And we can put the parts of the coronavirus spike on the surface. And when Neil does that, it finds it gets very, very strong immune responses, stronger than with the current vaccines. These designed nanoparticle vaccines are now in clinical trials. So that sort of illustrates some of the key areas in protein design now, being able to design very precise shapes that can block, combine very tightly to targets, being able to design molecular devices that can undergo, that can basically do logic calculations, and being able to design nanomaterials like these protein death stars. Yeah, to me, this is some of the most incredible stuff that I've ever seen in my life.

Starting point is 00:17:30 It's just amazing to me that you can go from a published sequence for this virus, simulate a compound that has this incredible binding affinity to the RBD, like in this computational domain, and then, you know, just be able to go from there to like diagnostics, vaccines, you know, like potentially therapies, like I'm guessing even that you could use these same techniques to try to assess how effective a human antibody response might be to the variants of the virus. Yeah, we're actually doing that now. Yeah, that's it. Yeah, you know, it sounds kind of science fiction-y to me, to you. It also still sounds that way to me, which is why, I mean, this field has been moving so fast. We've been able to

Starting point is 00:18:22 make so much progress to the things we can do now that still are a lot of work, but I don't think I would have thought would be possible just even, you know, several years ago. And that's what's been exciting. I mean, people always ask me to predict the future of the field. And I always say, well, my biggest hope is that it will move so fast in such new directions, I can't predict it. And that's actually what's been happening pretty continuously. Yeah. And so what are the big things that are moving progress forward right now?

Starting point is 00:18:47 Like what is, I mean, like you can sort of take a long view, right? Like I think the landscape was very, very different when you started than it is now. Yeah, it's totally different. What are the big things that have? There are just, there's so many things that are coming together. And I think when you have technological revolutions, that's generally what's happened. There's just been a lot of things that come together. And I think we just happen to be at the right place at the right time. So what are they? Well, our understanding of the basic principles of protein folding have been improving over the

Starting point is 00:19:18 years. And we've been developing this computer program called Rosetta to model these for 20 years now. So we've been doing a lot of work and sort of trying to get more and more of the details right. Second is computing power. These things are, despite my ridiculing people as a graduate student, these calculations now are, these systems are really complicated. And the basic principle is that proteins fold to their lowest energy states. So if you want to design a sequence that folds to a new structure, then you have to search through this huge landscape, and it's very time consuming. So the fact that computers have just gotten more and more powerful continuously has really

Starting point is 00:19:53 opened up. And we couldn't have not have done what we did for coronavirus in that amount of time with computing even of 10 or 15 years ago. And the third that's really completely independent is because of the genome project, there's been this huge advance in gene synthesis technology. So we design on the computer, we can compute, you know, millions of different possible proteins that are, you know, all possible creations, but actually bring them into the lab. We need to encode them in synthetic pieces of DNA, synthetic genes. And after we do that, we can put them into microorganisms. Who will then produce them? But synthesizing DNA is still expensive, but it used to be hundreds of times more expensive.

Starting point is 00:20:34 And so now we can routinely design 100,000 brand new proteins for all kinds of different applications. I mean, the range of things we're doing now in the group, you know, trying to create plastic, trying to capture solar energy, trying to do exotic chemical reactions, all sorts of things. You know, this just would not have been possible, again, with the technology of 15 years ago. So, being able to manufacture DNA very rapidly and cheaply is really enabling us to move forward quickly. And these things all go together because since we can compute, since we understand the principles, we can compute very large numbers of designs

Starting point is 00:21:09 because we can manufacture so many genes. We can then bring them all to life and then we can do measurements to see which ones work and which ones don't. And then we can fold that back in to improve the methods. And that brings up the fourth thing that's coming in, which is deep learning. And deep learning has just advanced by, you know, in the last 10 years has just

Starting point is 00:21:28 been incredibly, has advanced as fast as the other areas, if not faster. And so now we're complementing the Rosetta picture folding, which is sort of this physical model where you have this protein chain folding up with more of a deep learning approach, which is basically looking for recurring patterns and relating sequences to structures using those. And that's turning out to be very, very powerful. We can also use machine learning to try and interpret, to basically relate all this data that we're collecting to improving the computational models. And the fifth thing that's happening is that for the first time, these de novo design proteins are being developed as drugs. So there were probably four or five different proteins that we've designed, which will be in clinical trials this year. And so it's,

Starting point is 00:22:09 we're kind of at this inflection point in all these different areas. And so, you know, we don't really know how they'll turn out, but if one or two of them, you know, actually work pretty well, then you can imagine that, I mean, we'll have de-risked this whole kind of platform. And I think that the things that way engineering and biology has worked up until now is not intentional in this way. If you want to make a drug that binds to something, you either just screen a huge random library of compounds or you try and coax an animal to make an antibody against it.

Starting point is 00:22:38 There's no, generally in engineering and biology, if you want to solve a problem, you go look in nature for a protein that already does something similar, and then you modify it a bit. The relationship between sequence and structure has been really mysterious. But now we can actually design proteins with intent to do new things. So I think it's just getting to be very exciting. There's so much that you just said that I find fascinating and I want to follow up on, but like one of the things that as a computer scientist, that wasn't super obvious to me as I started diving deeper into biosciences is the extent to which you are,

Starting point is 00:23:15 we're now at this point where you can actually use biological machinery to go do work. So like what you just described a minute ago is you want to synthesize a protein, but you don't synthesize the protein by like having some 3D printer that just lays out a bunch of amino acids. Like you program a little piece of genetic material to use the cellular mechanisms that turn DNA into the protein that you want. And like, that's sort of like, that's just fascinating to me that we understand enough

Starting point is 00:23:47 about some of the basic biological machinery that you can leverage that to do some of the work for you. Yeah, I left out, there's a sixth thing I should have added to my list, which is exactly what you're saying. The whole recombinant DNA protein expression thing, our advances understanding basic biology, which we play a lot

Starting point is 00:24:05 of, we use biology right and left when we're, when we're making these proteins and then we're screening them for activity. So you're absolutely right. That's another thing. It's like all of these different technologies have come together. Yeah. And so I'm, one of the things that you, you know, that you've described is the, the end state of is uh de novo drugs uh like being able to design therapies for diseases like covet 19 and like in the limit like what you would be able to do what you'd like to be able to do you know for cancer therapies for instance is you would like to be able to look at like the very specific uh cancer mutation that you have, like making someone ill and be able to custom tailor a therapy to like the very particular illness that you have. And in order to

Starting point is 00:24:56 like, there's sort of two axes that I think are interesting. Like one is upper respiratory viruses are probably a little bit better understood than cancers. But like the trick with them is like you need to go very fast. So you need to go from new virus to, you know, therapeutic for it to drug that you can start delivering. And like the amount of time that that takes dictates like how bad, you know, a public health crisis you're going to have. For custom tailoring a therapy to an individual, like what you're trying to drive down is cost. So like we have some of these therapies right now, but it may legitimately cost a million dollars to synthesize the treatment just because it's very complicated. And, you know, you, you have five people who have the sickness a year.

Starting point is 00:25:47 So like, do you see both of those things getting better? So like time to, you know, deliver a therapy or like the cost of delivering a therapy or those both getting better? Yeah, I think there's also the precision of the therapy. So you brought up a number of really good points, but one of the issues with current protein drugs, which is why they're so expensive, is their antibodies.

Starting point is 00:26:10 They're very complicated proteins. Antibodies are what we make to defend ourselves against disease. And so naturally, in the sort of spirit of sort of emulating what's in nature, when pharmaceutical companies want to solve a problem now, they try and make a new antibody. And then that's the drug. The problem with that is that antibodies are very expensive to manufacture, and that contributes to the high cost. These proteins that we design, in contrast, can be manufactured for a hundredfold lower

Starting point is 00:26:35 cost. They can be made in bacteria, not in complicated mammalian cells, which are much more sensitive and much more difficult to grow. As far as precision goes, antibodies, the way they work is they're kind of blunt instruments. They hone in on a target. They're basically just, they're like the coronavirus binder I described. They just bind. We can, with design protein, we can actually design logic systems that actually can go

Starting point is 00:26:57 into the body and pick out cells that have combinations of proteins on their surfaces that are indicative of disease. Because in some cancers, there isn't really a single distinguishing mark. There's a number of things or some things are higher and some things are lower in abundance than they would be in normal cells. And you need to be able to resolve those differences. And so you need more sophisticated types of more sophisticated drug when they, like I said, can sort of do logic calculations in the body to really approach that. And in terms of time, you know, we're not there yet. But since everything's going on in the computer, then in principle, the reaction time should be much

Starting point is 00:27:35 faster, say to a new pandemic than it is if you have to. So with the antibodies, the antibody therapies for COVID-19, they weren't designed on the computer or designed by anybody. Instead, there was a lot of searching after and in the bodies of people who had been infected with the virus or animals for antibodies that happened to bind to the virus and really be effective at blocking it. And it's pretty rare that you find those antibodies. So if you can design things by intent, I mean, it's kind of like the way that biological drug discovery and engineering has worked. It's sort of like the way that biological drug discovery and engineering has worked. It's sort of like you're trying to build a building, you keep throwing a pile, you know,

Starting point is 00:28:08 bricks into a pile and you hope it assembles into a building. Well, it's much better if you understand the principles of construction and can just build it. So I think the future will be very bright. Yeah. And I'm just always reminded how complicated these biological systems are. Things are very rarely as neat as you as a computer scientist or a mechanical engineer have. You think, I will say for myself as a computer scientist, I often think that I'm dealing with complicated systems.

Starting point is 00:28:40 But compared to biology, our artificially engineered digital systems that we're building are like nowhere, even remotely as complicated as a biological system. One of the things is, you know, I think we've all looked at these vaccines as you have two types of vaccines now that are prevalent. You have sort of the mRNA vaccines that we are very excited about, and you have things that look more like classical vaccines. So they use something like an adenovirus to carry the spike protein into the body to try to produce an immune response. But one of the funky things about adenoviruses, right, is like if you already have antibodies

Starting point is 00:29:27 for the adenovirus itself, like the antibodies may swarm in and kill the vaccine before it can produce like the new immune response. So I mean, I'm just sort of wondering, yeah, one of the things that you said that's really fascinating to me is I think we very quickly, especially by classical vaccine development standards, went from the genetic sequence of this virus to vaccines or therapeutics. How long did it take you to design this thousand x binding thing once you had a sequence like a month two months it was probably it was probably um it was a few months but that was because we hadn't done it before and we had to sort of learn by doing and i think now we're working on methods for really doing that what we'd like to do is be able to do that within two weeks so the sequence of a new pandemic threat comes out and then two weeks later

Starting point is 00:30:24 we have a really high affinity antidote. That's aspirational, but I think it's possible. Yeah. And like, you know, everything that I've seen seems what, like, you know, far better than I do, but like, it sounds like a reasonable aspiration to me. Like the thing that I don't understand as well is then the hard part starts, which is trying to assess the safety of the thing that you just synthesize and like that's you know even with the mrna vaccines that we uh that we made like we had a pretty good

Starting point is 00:30:51 idea that they were going to work uh like we just didn't know that whether or not they were going to be safe yeah um and so how how do you i mean like can we use the techniques that you are describing to like do some of that safety assessment to drive the time? Yeah, it's a really good question. And that is probably one of the biggest question marks currently. So what we can do is on the computer design in properties that we think will correlate with safety. But humans are very, very complicated.

Starting point is 00:31:20 So it's hard to predict exactly what will happen. And so I think that is evaluating safety is the thing that is slow. And that actually is what has been taking all this time much longer than the design. So it's a good question. And there, we'll just have to learn from experience, I think. So what are you most excited about over the next handful of years? And like, what do you think we as citizens, folks who are very encouraged by what you're doing, who want to help things go faster,

Starting point is 00:31:51 what can we do to get some of this goodness that you're building moving faster? Well, let's see. I've been very interested in involving the general public and scientists at large in what we're doing. So we have an online game called Foldit where we post a lot of the current problems we're working on. For example, now one of the Foldit puzzles is that we've designed small proteins, which bind to different parts of the coronavirus. And now Foldit players are being challenged to connect

Starting point is 00:32:17 them in just the perfect way. So you get sort of the much stronger binding. So Foldit is one way. And then we have a project called Rosetta at Home, which is how we do a lot of our computing. If we send out jobs to people's computers and they compute new design proteins and send them back, and then we select from those which ones to make. So those are two of the ways in which people can get involved in what we're doing. Yeah. And like, in my opinion, like that's a much more beneficial thing for like our collective human wellbeing than like using your spare GPUs

Starting point is 00:32:51 to do cryptocurrency mining, for instance. Yeah, that's right. So, you know, just in terms of your work, like what are you really excited about over the next handful of years? Like how do you prioritize your energy among like all of the many, many, many things you could go explore? Well, there's a magic ingredient to all of this, which I haven't really talked about yet,

Starting point is 00:33:18 but that's part of the answer, which is that the most brilliant people in the world, graduate students and postdoctoral fellows are now coming to the Institute for Protein Design to sort of push the next wave of discoveries and make their fortune and start out exciting new independent careers in this area. So what I actually do every day is I talk to these absolutely brilliant people in my group about, you know, new people come in and, you know, we sort of brainstorm different ideas. They go and talk to everybody. I have sort of a theory of scientific creativity, this idea of a communal brain where everyone's talking to everybody all the time and like connected neurons, you can get these really emergent things.

Starting point is 00:33:55 So in terms of the new areas that we're going into, there's really, it's very wide and it's driven in part by the interests of people come in. But some of the really exciting areas are, well, really sort of pushing ahead on this very rapid therapeutic design, making better vaccines, biological machine, you know, designed machines, nanomachines that can do work. And for example, you know, we're making rotary motors now, kind of couple them to energy sources.

Starting point is 00:34:24 We're really excited about making new types of materials. So in nature, there are things in nature, we have examples like bone and tooth and seashells where it's proteins interacting with inorganic compounds to create all sorts of, you know, hybrid structures. And I think that's a really rich area. What if those things that you were interacting with were semiconductors rather than, you know, calcium carbonate, you could, you know, the sky's the limit there. Catalysts, trying to catalyze chemical reactions, which don't exist. We have a big effort now in exploring molecules that really nature couldn't explore because they're made out

Starting point is 00:34:56 of more exotic stuff than just the 20 naturally occurring amino acids. So new classes of compounds, we can take the same computational methods we've been developing and apply them well beyond outside proteins. And we're making systems that can sense the environment, respond. You know, I could go on and on. I'm excited about it all. Well, and it is sort of super exciting. And like going back to the intro like your your bio like you've started you co-founded 11 companies and so you know i i think beyond the yeah the fact that this is all scientifically some of the most interesting stuff in the world right now and it has this huge potential for you know impact in the same way that great scientific discovery usually does like you

Starting point is 00:35:43 also are like at the epicenter of this new entrepreneurial engine so like you're like in a lot of the ways like silicon valley uh like you go to a silicon valley school and you know like a bunch of professors there starting companies and like it's this vibrant ecosystem like i'm really excited and uh to see this happening with what you're doing yeah that's a really important part, because I told you with all the many brilliant people in their 20s are coming here to do great things. Many of them, it used to be they all wanted to go on and be professors. Now many of them come with the idea of starting companies, and many of them are doing that. I mean, I think

Starting point is 00:36:19 we'll probably be spinning out three more companies this year. And it's just, it's sort of accelerating. There's so many different things that proteins can do. And that's great because it is creating this whole new ecosystem. It's people and, you know, the people who don't want to start companies or take faculty positions instead are taking jobs at these companies. And the companies can then really go much deeper into the various application areas. We're kind of like a discovery engine, the Institute for Protein Design, but then actually bringing things out

Starting point is 00:36:50 in the real world is a whole other effort. And so that's what our spin-out companies are doing. Yeah. And then I also really have to think, the reason you didn't really touch on this, but what's really also fueling this is philanthropy, because, you know, what we're doing is so new, it's almost impossible to get grants to do this. And so a lot of a large fraction of our work is supported by private philanthropy, people sort of see this as that this really is the, you know, a technological revolution, and that pushing it forward rapidly will, you know, lead to all kinds of great new things for society. And we're really completely indebted to the people who are supporting our work.

Starting point is 00:37:29 Yeah, which I wish we could channel more funding. Like if I had a magic policy wand to wave, I would 10x, 100x the level of collective investment from philanthropy to government funding to corporate funding that was happening with this stuff. Like the return on those investments, both financially and in terms of goodness for society, like better health care for us all, for instance. I think it's just tremendous. So what would be your advice to students who are thinking about this as a career path? What should they study? What should they learn? It's it seems very interdisciplinary to me that, you know, it's

Starting point is 00:38:09 chemistry, biology, computer science, entrepreneurship, it's like just a bunch of things. I would say be a generalist is I mean, that's what you can sort of see from my trajectory. There's not, you know, the tech, this field is changing so fast that even if you were an expert in protein design today, five years from now, you would probably have to relearn everything. So yes, I think getting it's exactly the same. There's biology, there's physics, there's computer science, there's some sociology, you know, we live in the world. And so I think it's important to, yeah, to get a broad education. And, you know, and actually, when I look back at my own education is for a long time, I thought that, well, what the hell does social studies or philosophy

Starting point is 00:38:51 have anything to do with what I do now, but actually, it's turned out to be very, very useful in a lot of ways. So anyway, so I think that's in terms of a broad education. And then, you know, to actually, if you actually want to start doing this kind of work, being joining a research group, where in some capacity where this where such work, joining a research group where in some capacity where such work is going on is really, there's no substitute for that. Yeah. Well, so we're almost out of time here. The last question I like to ask everyone, and it's a weird question because everyone I chat with has such interesting work that they're doing in their professional lives, But I'm always curious

Starting point is 00:39:25 what you do for fun or what you find interesting outside of your professional life. Oh, yeah. Well, I love the mountains and are just going out, getting away from things, whether it's on the water or in the kayak or climbing or skiing or hiking. It's really what I love to do. During the week, there's a lot going on and a lot coming in and I need a little and having time just out in open space to process every once in a while is really great so it's yeah and you live in the pacific northwest like one of the most beautiful places in the world uh to be outside hiking or biking or kayaking or whatever you're that's right that's right and that's that's that's part of the reason I'm here. So, yeah.

Starting point is 00:40:06 That's awesome. Well, this is amazing. Like, thank you so much for taking the time to chat with us today. And more importantly, thank you for what you're doing. Like, I don't know whether you think about your work is like this great public service, but it very much is.

Starting point is 00:40:22 And so on behalf of many of us, like, I just want to express real appreciation and gratitude for the work that you and your students and your colleagues are doing. Well, thanks, Kevin. This was a lot of fun. And honestly, I'm not a surfer, but I just feel like we're riding the wave and we want it, you know, the longer we can ride it, the better hope that we can make the world a better place. That's really what we're trying to do. So yeah, this was a lot of fun. I enjoyed our conversation. Awesome. Thank you so much. Well, that was Kevin's conversation with Dr. David Baker. So what stood out to me about your

Starting point is 00:40:57 conversation was just how fascinating everything that's happening in this field is. Like, I honestly, I'm going to be honest with you, even after you said that this is what you would study in grad school, if you're in grad school right now, in my mind, I'm like, I'm not a traditional sciences sort of person. I don't know how fascinating any of this will be. And I'm just riveted by all the cool stuff that is happening in this space. Yeah, it's really crazy how fast things are moving right now. I think one of the things that he said that was really interesting and very, very true is that scientific revolutions, technological revolutions tend to happen when you have multiple things that are moving very quickly and have had their own transformations come together at the same time.

Starting point is 00:41:47 And he named off six of those in the course of our conversation. And yeah, it's one of those places where because it's moving so rapidly and because every day we're learning more and more and more about how to push things forward even faster, this is just where so much of the interesting things are going to be happening in the world over the next couple of decades is in this particular discipline in digital biology, synthetic biology, computational biology, where we really are trying to get this like high resolution and more accurate picture of what happens in biological systems and then being able to engineer those systems to solve really, really tremendous problems. Yeah, no, I mean, that's what struck me. I mean, like you said, he was talking about how all these different things kind of need to coalesce to have this real innovation. And when he was saying that some of the stuff that the industry has been doing and the scientific community has been doing with coronavirus wouldn't have been possible with

Starting point is 00:42:59 computers and technology from a decade or 15 years ago really struck me. And when he was talking about how the way that the engineering and the technology is working now is that it's intentional. Whereas before, you know, you'd look to nature and now you are intentionally and having this intentionality about saying, these are the problems we want to solve. And we're going to engineer a solution rather than looking for something that might already exist that can solve that problem. Yeah, I mean, in a very real sense, like many of these areas in biology now have a real engineering discipline that goes along with them in addition to the normal scientific practice.

Starting point is 00:43:40 Like we understand enough about the systems where we can start designing them in ways that serve purposes that are different than the ones that nature strictly designed them for through the process of evolution. And like, to me, that is the most fascinating thing. interestingness about this particular moment is we are all at a heightened level of awareness about both the upside and the downside of biology because of the COVID-19 pandemic. Like we know what one little molecule can do to impact hundreds and hundreds of millions of lives, and, you know, just sort of trillions of dollars worth of economic damage. And like, we also know that we can harness this greater understanding that we have about biological systems to, in an unprecedented way, design therapeutics and vaccines for the pandemic itself.

Starting point is 00:44:40 And like, all of that just gets better in the future. It does. And it makes it exciting. I mean, it's a little bit harrowing in some senses. And if you wanted to become really dystopic about it, it could be, you know, sort of concerning in that respect. But it's also so exciting, because it really does feel like the things that we've thought about in the past of being science fiction are really going to be things that not necessarily, right? Like that actually could be reality, which is, I'm going to be honest, I think if nothing else, I mean, I think it's exciting, but if nothing else, it's really interesting.

Starting point is 00:45:13 Yeah, I do think it's exciting. You know, I keep saying that I'm an optimist, even though, you know, my wife would look at me and say, I'm a grumpy a grumpy old cynic. But I'm very optimistic about the choices that we will make about how these tools and this greater scientific understanding of biology will get used. And I think you can absolutely see it with COVID-19. We have just had our scientists and engineers come together in exactly the way that you would hope to just pour time and energy and expertise into getting us past the misery of a global pandemic as quickly as possible. And I think not just the science has been a shining light, but the way that we've come together as a species has been a shining light for me. I agree. I agree. It's actually been pretty remarkable to see and to see what's happened in this relatively small period of time, all things considered. Anyway, this is a great conversation.

Starting point is 00:46:27 I love hearing about what Dr. Baker is working on in his work. That's it for our show today. We are so grateful to Dr. Baker for joining us. And you know that we love hearing from our listeners. So contact us anytime at BehindTheTech at Microsoft.com. Thanks for listening. See you next time.

Behind The Tech with Kevin Scott - Dr. David Baker: Director of the Institute for Protein Design, University of Washington

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.