Into the Impossible With Brian Keating - Meta NeuroScientist SHOCKED Me: Scale Alone Won’t Create Consciousness!

Episode Date: March 16, 2026

Please join my mailing list here 👉 https://briankeating.com/list  to win a meteorite 💥 David Sussillo spent years training neural networks at Google Brain and now leads research at Meta Rea...lity Labs. His verdict on the path to artificial consciousness might surprise you. In this conversation, we discuss why scaling transformers alone won't produce conscious AI, how recurrent neural networks differ from the transformer architectures powering today's LLMs, the FORCE algorithm he developed to train RNNs, what his work on Meta's EMG wristband reveals about AI and biological signals, Rich Sutton's Bitter Lesson and whether we're locked into a GPU-transformer paradigm, and why studying the brain may be the only way out of AI's current limitations. David is also the author of Emergence, a braided memoir that tells the story of his path from foster care to the frontier of neuroscience and AI research. David Sussillo is a research scientist at Meta Reality Labs and an adjunct professor at Stanford University. He completed his PhD at Columbia University under Larry Abbott and previously worked at Google Brain. Key Takeaways: 00:00 Scale alone won't produce consciousness — here's why  00:25 The "bitter lesson": big data crushes every algorithmic invention  01:15 How hallucinations were discovered early at Google  05:00 The lock-in problem: why LLM + GPU may cap AI's ceiling  08:25 What Meta's neural wristband is actually trying to solve  11:20 Rich Sutton's bitter lesson explained  14:15 "More is Different" — why you can't reduce complexity upward  28:30 The book Emergence — two stories braided into one  36:00 Why mentorship matters more than intelligence for at-risk kids  47:00 Stability over intensity: what kids in foster care actually need  ➡️ Follow David Sussillo  🌐 Website / Research: Meta Reality Labs  📚 Emergence (memoir): https://www.amazon.com/Emergence-Memoir-Boyhood-Computation-Mysteries/dp/1538768577  🎓 Stanford / Columbia / Carnegie Mellon Alumni ✍️ Email: davidsussillo@gmail.com  🏄‍♂️ Twitter: https://x.com/SussilloDavid Join this channel to get access to perks like monthly Office Hours: https://www.youtube.com/channel/UCmXH_moPhfkqCk6S3b9RWuw/join 📚 Get my books: Think Like a Nobel Prize Winner, with productivity tips from 9 Nobel Prize winners: https://a.co/d/03ezQFu Focus Like a Nobel Prize Winner, with life-changing interviews with 9 Nobel Prizewinners: https://a.co/d/hi50U9U My tell-all cosmic memoir Losing the Nobel Prize: http://amzn.to/2sa5UpA The first-ever audiobook from Galileo: Dialogue Concerning the Two Chief World Systems: Ptolemaic and Copernican https://a.co/d/iZPi9Un Follow me to ask questions of my guests: 🏄‍♂️ Twitter: https://twitter.com/DrBrianKeating 🔔 Subscribe https://www.youtube.com/DrBrianKeating?sub_confirmation=1 📝 Join my mailing list; just click here http://briankeating.com/list ✍️ Detailed Blog posts here: https://briankeating.com/blog 🎙️ Listen on audio-only platforms: https://briankeating.com/podcast #universe #podcast #briankeating #intotheimpossible #science #astronomy #cosmology #cosmicmicrowavebackground #intotheimpossible #briankeating Learn more about your ad choices. Visit megaphone.fm/adchoices

Transcript
Discussion (0)
Starting point is 00:00:00 I don't think scale alone, just transformers and lots and lots of data from the internet is going to get you to any form of consciousness. My cat is conscious, it snuggles. It's there, it's present, it's embodied. I was a bet that, like, there was still something to learn from the brain that would inform AI, I have to say, that I wasn't not successful. These things would just generate complete nonsense. If you add three periods to it or a backslash, they would just go crazy.
Starting point is 00:00:24 Every time there's algorithmic invention, big data just comes and crushes it. David Cecillo, neuroscientist, former Google Brain researcher, Columbia trained PhD, who's now at meta-reality lab. So what he told me about the limits of AI might change how you look and maybe feel and fear about what's coming next. Let's go. David Sassillo, welcome down to San Diego all the way up, down from the Bay Area. Thank you for having me, Brian. Appreciate it. You're the author of the second most difficult type of book to review and to cover on this podcast, which is a memoir, the first one being a novel, right?
Starting point is 00:00:59 You can't really get a novel fully described without spoiling it unless it's decades old. But this one, a memoir, is really just a fascinating work of pure love, but it's also a work of science. And we're going to get into all the mysteries and magic of it. But first thing I want to ask is, when I think about, you know, how we're using LLMs and I use them every day and I love them and then I have a great deal of fondness for how they deliver, you know, just incredible products that saved me a lot of time and resources, However, they have a stunning number of lacunae of flaws and foibles and gaffes that they make. And the first one I think about is sycophanty, which I actually like. I like it my AI to be a little bit, come on, come on. You've got to have a little bit of sycophanty, right?
Starting point is 00:01:43 And then the other one is hallucination. So I asked it once, you know, asked Chatshipping, you know, what books is Brian Keating written? Oh, losing the Nobel Prize, into the Impossible, a brief history of time. No, I'd like to have 1% of Stephen Hawking's book sales, but I didn't write that book, right? So is there any hope that, say, R&Ns might be more immune to hallucination or sycophanty, but let's just stick with hallucination, then, you know, other approaches for, you know, the typical approaches that are used in our, you know, LMs and so forth? On the face of it, I sincerely doubt it.
Starting point is 00:02:16 So the hallucinations, we wrote a paper about it in 2018, and we were looking at neural translation using R&Ns. So I don't think so. I think... Can you explain what neural translation? Neural translation is taking English to German, say. And so it's just translation and the neural language. Yeah, natural language.
Starting point is 00:02:38 And it's just you use a neural network to do it, right? And so what we discovered is that these things, I mean, it was at Google, they knew that these things would just generate. This is really early days. Just generate complete nonsense. So you could have, you want to translate into German. Hello, my name is David. If you add three periods to it,
Starting point is 00:02:56 or a backslash, it would just go crazy. It would just go crazy. These things, they've made major, major progress on this since then, right? But I don't see any reason why, in fact, I would expect it to be, Arnans to potentially be worse here. I think it's a very sophisticated problem that, in all likelihood, has an answer that involves many, many hacks, you know. And I don't do that research, so I want to be careful about, yeah.
Starting point is 00:03:21 Are there any particular, you know, in the consumer-facing side of things that the audience might be familiar with? you know, killer apps where R&Ns are superior to other forms of approaching the similar problem. What happened is the transformer happened, right? And so that just for the audience, the transformer is this much more, I want to say sophisticated. It is a more computationally intensive black box that utilizes this idea of attention. And so you can look at just the entire sequence of your input. And so that is a very powerful box. And so, you know, it really kind of coincided with my transition from Google that RNNs became out of fashion.
Starting point is 00:04:05 AI is really heavily dominated by fast-changing fashions. So it would not surprise me at all. There's been lots and lots of work on making RNNs work better, the so-called state-based models now, making these models more powerful, more efficient. So what really is going on is the transformer very cleverly, in fact, was devoutes. was built to work perfectly with a GPU. You're running uphill on that one. Yeah, and I've talked about this before,
Starting point is 00:04:34 but maybe I'll run my theory beyond you and see what you think. That actually, I think that we're in no danger of artificial superintelligence because the architecture that's been so successful is dominating the landscape from consumerism, which is all that matters, really. Attention is all you need,
Starting point is 00:04:51 but actually consumers are all you need for most things because then it'll eventually get propagated through the government, through military and it'll be education, it'll be impossible to extract it. And it's a phenomenon called lock-in. Like the most, the first mover advantage is often, you know, a disadvantage long-term. Like the QWERTY keyboard was designed such that the keys didn't stick together on a typewriter, which you barely know about probably.
Starting point is 00:05:13 I remember typewriters. But if you used, you know, like S&T too often, they would conflict and the hammers would stick together and you'd ruin your efficiency of typing. so they change from the more efficient form of keyboard organization, Dvorak to QWERTY, right? And now we're locked into it. My kids will never know. And your kids, you know, whatever, your students, your mentors.
Starting point is 00:05:33 They'll never know another type of keyboard. And for that reason, the first mover is not always, in fact, it's sometimes the worst solution. Like, you know, there are VHS and DVDs and all sorts of examples of this. My feeling is that the LM plus GPU is such a marriage made in heaven that nothing's ever going to break out of it. it and therefore we're not going to get to the true definition, which I told you about before we started, my definition is the Einstein test. Can an AI predict based on the pre-1911 laws of
Starting point is 00:06:05 physics that were known at that time the data from the planet Mercury could it come up with relativity? If it can't, then, you know, it seems to me it's good at checking things, incredibly useful, but it's not going to be super intelligence, let alone general intelligence, or maybe general intelligence, let alone superintelligence. What are your thoughts about this locking phenomenon? I mean, are we a victim of the success of these GPUs, married to LLMs, which were invented to solve the problem of doom that you and I loved in the 1990s, right? That's right, yeah. Who's to say that this will lead us to this paradise of superintelligence? I'm definitely sympathetic to the argument you're making about being locked in because history matters, right?
Starting point is 00:06:39 The details of the exact random choices that were made a decade ago. I have to say my experience with the AI community and industrial research is if you were to demonstrably prove, that some other architecture were better than they would all switch as soon as they possibly could. Like a major improvement. Of course, the problem there is how do you do it? And so this was a major problem at Google
Starting point is 00:07:05 when I was there. This is all at this point, public information. It's really history at this point. We had this new technology, deep learning. Of course, deep learning is just stacked neural networks. And so we had this research group together, Google Brain. And so we had all these people just innovating and innovating all these techniques.
Starting point is 00:07:21 That's where the transformer came from, right? And so what they did is they brought in all of the more applied research scientists who worked on the different, like the page rank or the neural translation or the ad placement or whatever. And they would just sit in and kind of learn how we were doing things.
Starting point is 00:07:37 And then they would go back and apply it to their systems. And it was, I think, a brilliant strategic move on Google's part to do that. It really, really changed everything for them. And so coming to the point, there was significant uphill battle even showing, That deep learning, which has taken over the world, was actually better than those systems because of this kind of entrenched advantage that you're talking about. So I think it's a real thing.
Starting point is 00:08:03 But for all of that, if you could demonstrably show that doing X, Y, or Z thing would be way better, I think they would eventually pivot. It's interesting because, you know, NVIDIA is a pretty big battleship to turn around, right? I guess, you know, with the TPUs that Google uses different story, right, because they're making their own, and so they can pivot and GROC is making their own. But it's still fundamentally the same idea. Massive matrix multiplications in parallel as fast as you can get them. And so what is meta's interest? I mean, the metaverse, you know, came and went without much fun, fair, but a couple dozens of billions of dollars put on the bonfire.
Starting point is 00:08:36 I had my meta-goggle glasses on in the other room. You can show me the version six later. You promised me exclusive view at those. That's not true. That had the holographic, you know, 3D rent. Oh, so you admit that they do exist. I fell into the trap. the interest in meta beyond
Starting point is 00:08:52 llama and sort of, I mean, they're also a behemoth. So what are their interest in the brain and neural networks? What's your role there and what's their kind of direction and why we should be excited about their future? Yeah, so broadly speaking, I want to be careful here. Yeah. I think just like every other company, Meta realizes
Starting point is 00:09:08 AI is here and they want to make the best products available that they can using those technologies and develop even in better and increased technologies as they do it. For my part, I actually work on a much more applied piece of technology, which is this EMG wristband,
Starting point is 00:09:24 which is meant to drive the glasses. Ah, the oculet, or the glasses now. The so-called neuroband. Right. That's an applied problem whereby you read out the electrical signals from the wrist muscles, and then you decode them to, say, hand right, or do gestures up, down, left, right,
Starting point is 00:09:40 this kind of thing. So taps and clicks and button presses. And so that's an applied problem, but it's an interesting problem because how do you get something to work on every single person on the planet, right? That's actually a pretty deep applied problem. Does it need to, though? I mean, if it works for the billion most successful consumers on Earth, that's a pretty good market.
Starting point is 00:10:02 I'm sorry, so that's, to me, that's the end went up to infinity already. The physicist, cosmologist in me. We think of billions, right? Yeah, that's right. So we actually were able to publish some of this work in nature. And I think that the real take-home message is that from a science perspective, nobody had shown, to my awareness, that like the deep learning recipe would work on biological signals. Right.
Starting point is 00:10:32 And so those kinds of scaling curves, I can't stand scaling law because is it really a law. Those scaling curves are true for humans in terms of reading out wrist movements. The more you had you get in these log linear plots, the better you do on your error. And so that's a real result. And that result has real implications for academic BCI because it means that, now, I want to be careful here, the muscles are a very simple system compared to, say, brain data. But for all of that, your muscle activity is action potentials, right? So it is a neural signal. So it's a very encouraging piece of data point for the program of brain computer interfaces.
Starting point is 00:11:15 So you didn't mention on scale and you talked about this in the book, that bitter lesson, you know, you talk about what is that? What's your beef with scale besides my own beefs is stepping on the scale? Talk about what is this, the limits of scale? Bill, I mean, these are scaling.
Starting point is 00:11:31 We have all sorts of power laws and Moore's laws. Yeah, so the context here, Rich Sutton, very famous AI researcher and reinforcement learning. He kind of wrote this thought piece in 2019. That is, you can't really talk about the future of AI without referencing this. So here we go, because it's actually a very, very good thought piece.
Starting point is 00:11:50 And he basically says that every time there's algorithmic invention, like someone actually thinks through an algorithm, just big data just comes and crushes it. So if you have cheap compute, lots and lots of data and neural networks, then you're just going to win every battle. So we should just give up and the lesson's bitter because we all, of course, value our own expertise, right? So to me, this moment, things are changing a little bit already because they're using RL techniques to improve language models. That's our reinforcement learning. So they're continuing to learn.
Starting point is 00:12:24 There's a lot you could say there. The idea that scale alone, that's what we're really studying. That's what today's moment is. What is scale? How far can scale take you? And I think from a consumer perspective, it can take you very, very far. Very far. right so the point that i make sort of in the afterward of the book is that predicting the future is
Starting point is 00:12:46 always a risky business but i don't think scale alone just transformers and lots and lots of data from the internet is going to get you to any form of consciousness um you know that's just somehow my cat is i would argue is conscious it snuggles it's it's there it's present it's embodied all of these things so i just don't think we're going to get there you know and and you know you know So what's really happened is historically, neuroscience really informed the beginnings of AI, really, truly. And then now we're in this world where AI is taken over. When I was at Google Brain, I was a bet that there was still something to learn from the brain that would inform AI. I have to say, I wasn't not successful.
Starting point is 00:13:32 I'm proud of the work I did there, but I was not successful in that program. Now we're at this point where, like, it's not just true for neuroscience, but for all of science. I think AI applied to those sciences is a major thing that we should be looking at. A lot of people are, right? You know, like my dream job would be like, let's go AI the heck out of neural data and figure out to make huge neural models and then go study how they work. So that to me would be very powerful. And then, you know, just to close it fully, it's my guess that one of the ways out of this scale is the only solution is to actually use AI to study the brain, pull some insights and or data extracts from the brain from doing so,
Starting point is 00:14:13 and then potentially apply those to AI models. The book begins with a wonderful kind of digression from famous physicist, Philip Anderson, and his famous quip that, you know, more is different, which, you know, on the face of it sounds pretty banal and self-evident, but it obviously resonates with you. And then the concept of complexity emerging as a title, indicates, you know, looking back on the highly improbable set of conditions that led you to be, you know, sitting here talking about this wonderful memoir slash science book. And we have to talk about the last
Starting point is 00:14:48 chapter because I do feel like that could be a separate book. But we talk about that later. What do you make of that statement that more is different? I mean, is it is the converse true? You know, less is similar? How do we reconcile these pithy statements with with kind of the deep complexity that I may be hiding in this kind of little three word sentence right when I think of more as different I think of it as a catchphrase for the idea this program that you tell me when I go wrong here because I'm a visitor here
Starting point is 00:15:19 that this program that physicists had that if they just understood the basic interactions of particles then the world would be understood and that was just grossly naive it's just not true at all and it was the second or third time it wasn't true Newtonian kind of clockwork universe ideas two centuries earlier before, you know, Anderson wrote this and the, you know, people used to mock physicists for, you know, having so-called solid state.
Starting point is 00:15:43 They call it squalid state because it was this, you know, dysfunctional aggregation. People trying to take the microscopic and extrapolate to macroscopic. Exactly as you're saying. And now we call it condensed matter physics, which is even worse name in my opinion. But yeah, so go on. So, yeah, this. So the idea, it's very straightforward in like the physical hierarchy of man. matter. So like, just because you know how atoms work doesn't mean you understand chemistry,
Starting point is 00:16:07 just because you know how chemistry works doesn't mean you understand biology. You can keep going, right? In biology of the brain and so on. But biology is a bigger leap, but biology, excuse me, neuroscience to society. That every single one of those levels, you need theories at that level, right? And that's really what he was getting at. And so I really enjoyed that thought piece because it spoke to me about my own experience in the world of if I wanted to study, have a scientific program to understand how I got from A to B, how would I do it? And basically the answer is I couldn't.
Starting point is 00:16:47 And I kind of just have to sit with that answer. The last section of the book, you call it an afterward, which is normally a place, you know, people kind of wax rhapsodically about the contents of the book. You actually go into basically a very technical digression about the applications of the different types of technology and the influence that AI is having on the world.
Starting point is 00:17:07 I wonder, as I said, it's six pages long or something, but I felt like it could be a book in itself. Totally. Did you, maybe you have plans for, you know, follow-up, you know, emergence too, you know, in 3D, I don't know, Ms. Pac-Man could have been in 3-D. What are your thoughts? I mean, the main, you know, kind of takeaway is,
Starting point is 00:17:23 I see you as an optimist. Yes, I am. And it's in contrast. A lot of people we've had on the podcast, Max Tagmark, others, not so much. So what do you make of this kind of battle brewing for AI safety, alignment, all sorts of other things, if you would? So first, to address the questions about the book, it's a braided memoir. That means there's two stories, the story of my life.
Starting point is 00:17:40 And I also tell the story about how neuroscience and some ideas from complexity theory had led to, well, it's really neuroscience, led to AI. Right. And so at the end of this book, there's a conclusion to both stories. And the last chapter of my personal story ends on what happened to myself and all these different kids. And then the afterward is really meant to be the conclusion to the AI neuroscience story. So that's how that works in the book. In terms of, you know, the second part was you're just asking for prognostications on where this is going? Yes.
Starting point is 00:18:14 Yes. I'll be honest with you. I don't know where it's going. I am an optimist. You know, like something happened in the last few months that everyone is really on edge about this stuff. And I think it's probably because the models got better, right? And now this idea of just big data, there's reinforcement learning going on. These models aren't sort of improving based off of these things.
Starting point is 00:18:35 But I think there's, to me, what's going on right now, the zeitgeist of this very moment is that people in positions of power, decision makers, managers, CEOs, they're all just finally catching up with it. So we're having what I believe is really a very human moment, right? And that, first off, these things, they're not robotic. They don't interact with the world as we do. You know, to come here, I had to coordinate with 17 different people. You and I are having a conversation.
Starting point is 00:19:04 There's humanness in everything we do. And, you know, everyone's worried that, especially in software, like, I certainly think software is one of the first things to really going to be impacted by this because so much the infrastructure is ready, automated. So much of software's text and so on and so forth. But like, anyone who's been senior in software engineering knows that. senior software engineers are largely talking to people all the time. Yeah. Design jobs. You're just, I'm having trouble swallowing this like, you know, we're all going to lose our jobs.
Starting point is 00:19:37 I definitely think we should be paying attention. I think everyone should be. Some jobs, yeah, for sure. Yeah, definitely some jobs. But like, rather, I think we should be adapting to these technologies. And I also look at it a little bit like a pointillistic painting, you know, that for a long, long time, programming was like doing one of these paintings point by point by point. And through time, it's gotten better.
Starting point is 00:19:56 We used to program assembler. Then there was C. Then there was Java, whatever. Then there was Python. So we slowly abstracted away the machine. And so I just think that, you know, one view of this is that we're abstracting now to just say, hey, go do this thing or go do that thing. And then figuring out ways to combine those. Let me end with a caveat.
Starting point is 00:20:16 I can't predict the future. So I don't really know. I'm in it just with the rest of us, which I think is really cool and exciting. Yeah, it's a fun place to be. Hey, everybody. I'm usually the one that asks my guest. to judge their books by their covers, but today I'm asking myself to judge my own book by its cover. My newest book, Focus Like a Nobel Prize winner, is chockful of advice, life tips, and focus
Starting point is 00:20:37 and productivity tips from nine of the world's greatest minds. Nobel laureates ranging from economics to peace to physics, of course. So go to Amazon and get the Kindle copy today. Okay, so let's pivot to this force algorithm network. Let's go through it. And then keep in mind, you know, most of the audience, astronomy literate and very scientifically literate. And very scientifically literate, but may not be as neuroscientifically literate. So take us through, what is force, what does it do, how is it involved, and maybe how does it differentiate
Starting point is 00:21:04 itself from things we might be familiar with? Yes, sure. So let me back up to back propagation as the means to train feed forward networks. This was a thing that was developed in the 80s, and because of the slowness of computers, it really never took off, and we had this
Starting point is 00:21:21 so-called AI winter in the 90s. So it turns out, let me just, just one of the punchlines is like, well, we just use back prop these days on big complicated networks, even recurrent neural networks. But when I was doing my PhD, we didn't know that. There's a lot of, like a lot of tricks of the trade that were developed that have slowly just become part of the collective knowledge around neural networks whereby things just work now. Plus we have GPUs. Yeah. But in 2000, say, 2004, 2005, we really wanted to understand how recurrent neural networks functioned. This is again artificial neural networks with recurrent connectivity that make them nonlinear
Starting point is 00:21:56 dynamical systems because we want to understand how brains function. And unless you're working in sensory area, so vision processing, auditory processing, that there the idea is that the sensory processing can really be well approximated through feed forward cascades of information transformation. If you're not in that realm, if you want to think about motor control, if you want to think about executive decision making, then you really need to have recurrent networks. So that then boils down to a task of, well, how do you do it? Okay, we make this little artificial network. And then what, well, how do you train it?
Starting point is 00:22:31 Well, back prop, it's known that back prop doesn't work for RNNs. It turns out it does within limits. But there's proofs about this. There's an unstable eigenvalue no matter what you do. And the Jacobian goes backwards, blah, blah, blah. It just isn't going to work. And so what forced learning was, is my PhD thesis with Larry, was in a attempt. to solve that problem.
Starting point is 00:22:54 And what we did is we built off of some work in the echo state literature. You basically give up on training the internal connections of a network and you just train the outputs. And the idea is you have this very rich non-linear system. So if you drive an input into it, then you get all these harmonics and phase shifts of that.
Starting point is 00:23:12 And so you can reconstruct those through a linear regression to get whatever target you want. We figured out a way to close the loop so that even though you're training the outputs, you're effectively training the outputs, the inputs too. So then you run into problems of stability and you run into these problems in and around chaos and how do you control chaos. And so forced learning turned out to be this really powerful way to solve those problems. The upshot there is that, you know, around 2008, 2009 for the first time,
Starting point is 00:23:39 again, with the caveat that LSTMs were always out there and nobody knew, that was the first time that this community of neuroscientists and computational theoretical neuroscientists could train recurrent neural networks for the purposes of making contact with brain data. And therefore, of course, the goal is to try to understand how brains were. And let's start with monkey brains. At least that was the inspiration. That was with Omer, right? Yeah.
Starting point is 00:24:00 So the idea with Omerie was, so now we could train these networks. But what good is a toy model when you don't understand your toy model? Right. So the tried and true tradition in kind of all of science is you get some data. You study that data through lots of effort. and rigorous analysis, you build a model by hand. You decide what you think is going on in that data. You build the model, you test it on some held out data
Starting point is 00:24:27 of whether or not it explains or predict something. And then you say if the explanations make sense and the predictions work, that's my model. There's a long history of doing that in neuroscience too. And there's been progress doing that too. I would argue that today, the field is in a place where we have these high dimensional electrodes that we record with.
Starting point is 00:24:45 So now we're getting thousands of units it's recorded thousands of neurons. Drowning in data. Yeah, just drowning in data, not really understanding how to make sense of it. And so now really the argument is let's train neural networks to see what they do. That then requires you to understand how those recurrent neural networks function, these artificial networks. And so what had happened is in an effort to try to understand any of this, I took a graduate
Starting point is 00:25:08 nonlinear dynamical systems class. And we studied CAM theorem and we studied, we studied all this stuff. We've studied, but we learned this really basic stuff. Like if you have a simple fix point and a tractor in this landscape, then first off, the activity around there doesn't move. You can linearize around it. And so then the question becomes, well, a linearization, there's no guarantee about how wide in the volume. I just like, hey, can we do this? You're like, would that dumb text?
Starting point is 00:25:34 That's not dumb. Very simple technical methodology work. And so I was asking around. And I was like, no, probably not. And Omen was like, why wouldn't it work? And so we started collaborating on it. And so we developed, you know, when we really drove the technical side of this, some optimization methods to find these fixed points and linearizations in these networks.
Starting point is 00:25:54 And what we discovered, at least for simple cases, and simple gets is a set that grows with every year, you could understand the functioning of a recurrent neural network by virtue of its fixed points and its linearizations. So that compresses the problem and you don't have to worry about the global landscape? I mean, I want to be careful that. You may have to worry about it. But what I think we're discovering is that they're in the world of dynamical systems, and there's really crazy things out there, like the Lorentz Attractor, or like just these iterative maps that have crazy chaotic behavior.
Starting point is 00:26:29 These techniques aren't going to be altogether helpful there. I think computation is a regularizing thing. The fact that this dynamical system does something with computation, it makes it simpler. It makes it regular. I want to be, this is more simple as complicated. Ironically, yeah.
Starting point is 00:26:49 Anyway, so like, like linearity is a good idea until it isn't. Yeah, and then it breaks, right? Yeah, until it isn't. And so if you sort of view it these systems as maybe like switching linear, it's linear over here because, well, you need to add up numbers. And then it's linear over there because you need to do an oscillation, whatever, right? So then it becomes, you've reduced the problem to saying, well, how do I switch?
Starting point is 00:27:10 these systems from being an oscillator to being an adder. And so in that sense, I think we're lucky. And so the complexity is really in the fact that if you have a very powerful computer type system, let's say a calculator, let's say you trained an R&N to be a calculator, right? What would be those structures? Because they would probably be fairly sophisticated in order to handle 12 digits of multiplication, et cetera, et cetera. What I would say is it's not in the in the chaoticity of the system that I'm worried about. It's in just the overwhelming complexity of size. Just you're overwhelmed. The thing I wanted to ask is what you're never supposed to do. Whenever you read a book, you're not supposed to judge it by its cover.
Starting point is 00:27:58 Hey, book lovers, we're judging books by the covers. We know we're not supposed to do it, but it's a the impossible, there's nothing to it. Let's take a look and judge some books. So I want you to take the audience through the title, the subtitle, and the beautiful, really beautiful cover art. The publishing agencies are like, you should really lower your standards on book covers because the process is hard and it's just the iterations and who knows what's going to happen. And the first one they showed me, I was like, that's it. I love it. This is gorgeous. The title is emergence and that's meant to be used in two different ways. One is the story of my life coming up from fairly difficult circumstances. My parents were both drug addicts and ending up in science and having a career neuroscience and AI. And the other is meant to invoke the nonlinear
Starting point is 00:28:48 phenomenon of emergence, the idea that when you combine simple parts together, they create things that could never really be predicted. That has sort of informed how I think about my work. And I thought of it as a nice scaffolding for how to tell a story about somebody who came from really impoverished beginnings to where I am today. Yeah, the title emergence has come to mean a lot of different things. And in particular, your research, which we'll get into, kind of presage this moment that we're in, where neural networks, which were the backwater,
Starting point is 00:29:17 I remember we're about the same age, I'm a couple of years older than you. But I remember in the early 90s, mid-90s in grad school, people working on neural networks, oh, it's nonsense, the brain is on a computer. It's completely different, something different altogether. How does it feel to witness those techniques emerge as indisputably the victor?
Starting point is 00:29:34 I'm not saying it's a good thing, and I'll get into that later, but to just effectively take over, literally seems to be eating the world. I think it's very exciting. You know, for my part, I feel like, you know, I was a contributor to that, but there are lots of contributors
Starting point is 00:29:50 and many much, much with greater impact than me. But for all of that, yeah, I feel like it's an interesting and exciting moment and I'm happy to be alive while it's happening. I want to study these things. I want to study the brain. I want to use these things to study the brain. So I'm excited about it.
Starting point is 00:30:05 How do you react to the explosion of the technology, not just in the research that you and I do, sometimes a backwater compared to what the impacts are on society. Certainly impact of cosmology on society isn't very big. But how is it that these things are influencing? Supposedly people are not needing doctors anymore. Software service companies are going to go out of business. Market cap gets destroyed every other day when Anthropa comes out with some new product.
Starting point is 00:30:30 People are forming synthetic relationships. Some people allegedly have been accused of self-harm perhaps because of the influence of these bodies. How does that impact the research that you do? Or do you just kind of ignore it? Or how does it impact you? The thing that I'm really focused on right now is just broadly the disruption, right? What happens when you have this technology that for all the things that you just said is impacting people's lives in ways that never really happened before?
Starting point is 00:30:59 And so there are a few precedents. I mean, we can look at what happened with the Internet. We can look at what happened with social media and so on and so forth. And I don't want to be a Polyana. I think there are some potentially very terrible outcomes if we're not careful. But I've got to say, you know, if I had a chat bot when I was seven or eight, it would have been amazing. I'm very bullish on these technologies to help educate kids, to help educate people in third world countries and so on and so forth. So I guess that would be my answer.
Starting point is 00:31:28 How when we look at the types of neural networks that you study and you'll hopefully define how they work. But in the presence of noise, you know, the human. and brain is a supercomputer that works in a wet, sloshy environment. You know, some work better than others. But they're basically very simple organs. They can be decomposed into neurons and axons and all sorts of nerve functions. And many other creatures have them, but don't have the ability to create things the way that we do. How do you see that in the context of emergence?
Starting point is 00:31:56 These structures emerging to be the most complicated object in the known universe. I'm an expert in that latter subject, but not the former except for my name is an anagram for brain. Nice. But how is it that the most incredibly complex thing can emerge from something? And maybe take us to a simpler level, perhaps. How does cognition, language instead of pattern recognition, emerge from these RNNs that you study and first define what they are? Yeah, sure.
Starting point is 00:32:20 So I would just push back that I don't think of a brain even as an organ is a simple thing at all. No, I don't think at all. Okay. So what the last, I guess, 7,500-ish years of thinking about brains from a computational point of view really shown us is that like when you abstract away the details of all of the biological detail and you say boil down a neuron which is a very monumentally complex biophysical system get rid of the DNA get rid of the
Starting point is 00:32:49 molecules and all of the ion exchange that's happening to actually propagate electrical activity and just you know say this unit is a simple unit that we're going to call it an artificial neuron which I think many people would be familiar with where it receives these inputs if the sum of those inputs are greater than some threshold value you emit your own output otherwise you don't right so that's the artificial neuron and what the last 75-ish years have shown is that when you connect those things this idea of connectionism that the magic of all of those types of computations that you're talking about can
Starting point is 00:33:23 emerge because those kinds of neural networks can compute anything and so the kind of thing that I've studied is a so-called recurrent neural network oftentimes in neural networks you have basically three varieties now you have the feed-forward network sort of just an assembly system for the information comes in. You transform it, you transform it, you transform it, you get an output. You also have transformers, which are the new hot thing that everyone is building the AI off of.
Starting point is 00:33:45 I'm going to skip that. And then you have recurrent networks, which are relevant really for understanding how brains work, which is that the connections from the neurons feed back other neurons that are connecting to them. So you create loops. And in the process of doing so, you create a dynamical system, dynamical system that can sustain activity that can hold a memory and so on and so forth. And so those are the kinds of the networks that I've focused on.
Starting point is 00:34:11 And what the brain really is is just this enormous high-dimensional non-linear system, at least at the electrical level. When you think about non-linear systems, we think about sensitivity to initial conditions. I want to get into your initial conditions in a moment. But talk about the hallmarks of chaos, of noise, of disorder in the brain or maybe just in the R&Ns. What is the role in that feedback loop of what's the perils of possibly injecting unwanted signals, biases, noise, and that other? Yes, so things can just go off the rails, right?
Starting point is 00:34:39 So unless you have a system that is highly controlled, potentially with feedback loops for control of that system, then things can just go off the rails. That could be true in a very small, low-dimensional, you know, three or four-dimensional system of equations. Certainly could be true in a high-dimensional system like a brain. There are classic results now from the 80s, I'm Sam Polinsky saying that if you have a big enough recurrent system and it's randomly connected, it will either have no activity or it will be chaotic. So you kind of have to contend with that. We don't know all the ways that a brain is doing it, but it's a very deep subject and we should study it. And, you know, for training artificial networks, we have some ideas. When many people, you know, first got exposed to the notion of this nonlinear dependencies, it is in the context of what's called chaos theory.
Starting point is 00:35:25 I remember in the book, Chaos. came out when I was a freshman and you were in high school, you know, I was in college at your bitter rival, Case Western, Go Spartans. That book came out, James Glick, pioneering, you know, kind of monumental book. But one of the things that, you know, really appeals to me about your memoir
Starting point is 00:35:40 is that you show the candid way in candid detail how people that started of basically in identical circumstances came out to be so, so different. That obviously isn't lost upon you. I mean, you spend a good deal into the book writing about it. When you look at that,
Starting point is 00:35:55 what are your kind of, you know, aside from wanting a chat about when you were seven, I mean, what sorts of things could we do to engage, you know, so that people are maybe less sensitive to initial conditions? Or is it irreducible? I have a feeling about my own life, a little bit of like there, but, you know, for the grace of God, you know, would I be in some other place, right? Very potentially bad place, you know. So I follow the outcomes of my best friend in second grade. I follow the outcomes of some of the people I met in the group homes that I lived in, my sister's outcome. Esther, what I do, and we just see where things, what happens to these kids, and the outcomes are all over the place. And some of them are quite sad. In that sense, not being able to understand, you know, why I ended up the way that I am. I can definitely point to things. Don't get me wrong, I can definitely armchair and be like, yeah, well, I think this, that, and the other thing.
Starting point is 00:36:47 And we should do that. But like, when I really sit down and think about it, I have to say, it feels like we're just blown in the wind in the chaotic system. I really, I really, I really mean it. You know, like, I have this sense of gratitude that I, that, that, that, that, that, that didn't happen to me. Let's just take the reader and listener back rather. So you raised basically what we used to call an orphanage. We don't really call it that anymore, right? That's right? But essentially that's what it was. Your parents are both drug, drug addicted, uh, dealing with that as you were from birth, basically, and presumably, maybe even from before birth. And you have an older, had an older sister, tragically, she did pass away.
Starting point is 00:37:22 I mean, it's so improbable. I mean, it is like, talk about playing Pac-Man, you know, playing an arcade game. It's like spawning into, remember the three-dimensional version they tried to make a Pac-Man? It's so improbable that you're sitting here today working meta-reality labs, you know, Stanford adjunct professor, having gone to, you know, gotten your PhD at top, neuroscience program, perhaps in the world, Columbia, and Carnegie Mell and all these other things that are so improbable. But it's kind of depressing if it's like just.
Starting point is 00:37:49 because you're an awesome human being, that's how you did it. Like everyone, if you're as awesome as David, you can do it too. But I always feel like that's kind of satisfying. It's a set message. But maybe it's true. I mean, don't sugarcoat. No. So, yeah. So what I would say is if I look at the things that made a difference in my life, I think many of them are reproducible for their circumstances. I was born very smart. There's just a genetic gift I had no control over. Being smart is a useful thing because you can actually examine the world around you in ways that are beneficial. But, you know, when I was in, when I was in college, there was a guy who threatened to blow up the physics department. And he got kicked out. And he was a pretty bright guy. So there's,
Starting point is 00:38:30 there's clearly more variables at play here. What I would say is that what mattered for me is having a sense of the future. I had an out. Early on, my teachers would make me the tutor of the class or, you know, I was into the gifted and talented program. It became clear to me that there was this thing called intelligence. I had it and maybe there was a way out. We discussed it in the group home with some friends. So I would say if you had some kind of talent that other really because what we're fighting here is neglect, right? We're fighting neglect. And so if you have some kind of talent that, you know, an adult takes a notice of you and then gives you a sense of perspective into the future, hey, you know, if you do this, you can get out of this. I think that
Starting point is 00:39:13 that's very helpful. So it's not, it wasn't central that I was smart is what I'm getting at. I think that people involving themselves in my life. There were multiple people. Some of my aunts and uncles really got involved was really important for me. I look at some of the, however, impoverished the institutions that I grew up in, I mean, it was better than being on the street. I grew up religious. I think that some of the way to, how I structure my behavior now is a function of those
Starting point is 00:39:41 religious experiences. Religious experience has a specific connotation. as a function of those ways of living. Those were all things that were helpful to me. You and I were speaking before we started recording about your religious upbringing. I started some of my thoughts in religion as well. But it is sort of true.
Starting point is 00:39:59 I mean, there aren't many like the Richard Dawkins' orphanage, you know, or there aren't many like militant atheist orphanages and so forth. And it did sort of give you this insight that there's more to life than maybe just science at the early age when you were really interested and computers were coming online. You kind of had this amazing emergence, confluence of events that made you who you are.
Starting point is 00:40:20 But I also don't want to make it only seem like because you're so brilliant that that's how you succeeded. Obviously, the brilliance and your sister was very brilliant, too, as I understand from the book, at least. But then, curiously, at least from my perspective, you seem, you come down quite positively on something that's completely under your control, unlike nature, you know, nurture. And that's in the, you know, cognitive behavioral therapy and the therapy, you know, psychoanalysis that you underwent. Most scientists are not
Starting point is 00:40:46 positively predisposed to psychoanalysis. Popper, you know, was originally kind of developed his scorn for pseudoscience from Freudian psychoanalysis, which you talk about in the book, as kind of the hero, but you also mentioned its shortcomings. So talk about that. How did you reconcile that as a scientific,
Starting point is 00:41:03 you know, I always forget, what are we, right brain, left, I mean, you might be both. How did you reconcile that kind of skepticism that I presume you had with like this more touchy-feely kind I think it's the left brain. Side of things is psychoanalysis. I got out of high school and I went to Carnegie Mellon.
Starting point is 00:41:18 And I had four pretty good years there. And then I left high school. And my life kind of just fell apart. I was living alone for the first time. I think I was working really, really hard in college. And that was like kind of burned me out a little bit. But more importantly, it kept me from like actually taking a moment to take a deep breath and see like, whoa, I had this crazy childhood.
Starting point is 00:41:39 And so when I was in Boston, I lived for a year in Boston. And things just totally fell apart for me in a way that was really, really painful. And so because of that, I started reaching out for help, you know. And so my uncle is a psychoanalyst in New York. And so I asked him for a recommendation. And he gave me one. And I wanted to engage in this process. I was sick.
Starting point is 00:42:00 I needed help and I knew it. It's amazing you had that self-awareness. Well, it was pretty bad shape. Because it could have been much worse. I was borderline ready for a hospital. So coming to your point, I just engaged with the process. And what I would differentiate, and what I think, honestly, a lot of people, a lot of professionals really have two different ideas. One is like the things that Freud said, many of which have been refuted even by that community, versus the process of psychotherapy, right?
Starting point is 00:42:25 Just lots of new theory about what that is, right? And so what I would say is that it's just a very complicated process. So first off, there can be almost no doubt that this really helped me, right? There is no doubt in my mind. How do I reconcile that? I just think of it as trying to make, can science really make sense of such a complicated mode of interaction? If we can't understand what like a single Prozac molecule does when we can actually see what
Starting point is 00:42:49 that molecule is, how am I supposed to make sense in a scientific sense when you and I are talking right now and the sort of impacts that this conversation will leave with us on each other afterward, we're done. From a scientific perspective, that feels like quasars in another universe. I don't even know. It feels like really hard. And so because of that, I just accept that. You know, I know sort of empirically that this is worth for me.
Starting point is 00:43:14 You've written the book, and it's a beautifully written book. It's multiple books. It's not just one, you know, three. It's really two books braided together. Yeah, it really is. And it's, to do that. And I haven't read this book, but I used to have a sponsor called Short Form. And I got their summary of a book called Educated by Tara.
Starting point is 00:43:34 Westover. Westover, yeah. who's also raised, like, militant young Earth creationist and, like, they didn't take aspirin and things. But there's not much for my audience and say that book. I mean, I found the short form summary, perfectly, you know, adept at summarizing when I needed to know. But it's kind of bold to write a memoir, you know, for your first book, especially as an accomplished scientist and researcher and working in industry and business. I don't talk about that, too, that transition. What about the story you felt needed to be told?
Starting point is 00:44:01 And what sort of is the massive takeaway? I'll tell you what I feel. I feel like it's tragic what happens in foster care. And in orphanages, to the extent that they exist. I mean, as I said, I give a lot of credit. I'm not, you know, I was raised Catholic, but I'm not practicing Catholic. I'm practicing Jew now. But they do such a better job than any other organization on Earth, the Catholic Church,
Starting point is 00:44:22 taking care of young children. And yet, it's far for perfect. Siblings get separated. I mean, that's the most tragic. I have three brothers. That's one of the most tragic things. As much as I fight with them, I would not want to be separate from. And you were separated from these people that were your family, and you'd later reunite with them.
Starting point is 00:44:37 And some of them, starting from the exact same initial condition, branched out, none of them are Stanford professors. So what are your takeaway? What did you resolve differently after, you know, writing this book? And how do you expect the reader to react after reading the book? Yeah, sure. So I'll be honest with you. I wrote the book. So I've been in psychotherapy 20, 30 years, right?
Starting point is 00:44:58 Like, I have largely processed this material. I wrote it because I thought it would be a good story. As I wrote it down, that's when I started thinking about, like, well, what are the take-home messages here? And in my opinion, the take-home messages are just shining any light on the kind of experience that I had. I mean, foster care, I'm not an expert in this topic, but as far as I understand things, foster care is largely a failed experiment. What could we do different, you know, or how could we do something differently? It's worthy of having that conversation. I think many people could just sort of understand, like, what were the things that helped me, right?
Starting point is 00:45:35 There are moments of intervention, and why did those things help me? I think that's something that could be understood as well. But tell me, this foreshallengar, and talk about Larry Abbott, who I know from his pioneering work in the Axion, which is not the axon, very different, right? Tell me about working with him and the impact of mentorship, which I also see is a theme in this book. It's a story of how just a chance encounter. You might think it's a tiny thing, but anyone can do it. I always say, I don't know. You don't have kids, do you?
Starting point is 00:46:04 I do not. I've got some if you want to borrow some. But anyone can be a father or mother. You could be a biological father. It could be, but some biological fathers, our fathers weren't great fathers. You reconciled. I reconciled it eventually towards the end of my father's life. But anyone can be an ideological or psychological.
Starting point is 00:46:19 Father, mentor, mother, figure, big brother, big sister. But it's gotten kind of corrupted by Michael Jackson, the Catholic Church. You know, all sorts of other things. But tell me what the role of mentors. And let's talk about Larry and the lab work that you guys did together. I mean, Larry's just an incredible guy. I might literally take a bullet for that guy. Like, I love this man.
Starting point is 00:46:39 Yeah, it shows through in the book. He, of course, the Axion, he did this little study on the Axion way back in like 80, went ignored for three decades. We're trying to detect it now. It's a time. It's 500 citations a year, right? It's just incredible. So good for him, you go, you know.
Starting point is 00:46:57 But he pivoted in his 50s, I guess around 2000 to working on neuroscience. I guess he pivoted a little bit early, actually, in the 90s. But anyways, I came across him in the 2000s when I was educating myself in a PhD. And through just sheer luck, I ended up in his lab. And it turned out to be this fabulous experience for me where he and I really just felt very, there's a simpatico with us in terms of how we thought about things and the way we worked. And he had this simulator. He would actually use the computer to do a lot of his work as opposed to doing just math.
Starting point is 00:47:30 And so I really related to that because it felt like the video games and the programming that I was skilled in. And so we really hit it off there. And what I would say just, just, you know, before we get into the technical stuff, what I would say about working with Larry is like our relationship just grew. And so there was a stability there. Larry didn't come out and said, you know, hey, Dave, I'm going to be a father figure for you, right? It just sort of happened. I don't even know if he would characterize it that way. But like we have a very strong friendship and mentorship at this point.
Starting point is 00:47:57 And it just developed and grew over the years. And so he's been a mentor to me even today. I'll give him a call, you know, three or four times a year. That, I think, is hugely important to kids especially. I don't think you have to take on the full burden. You just have to be this point of stability where a child can pivot around that, can rely on that can build upon it because what really happens in foster care is that you just live with so many people. You are effectively anonymous, right? You just are. So I lived with literally, I lived with
Starting point is 00:48:33 12 different sets of parents from the age of 7 to the age of 18. And nobody even knows what happened to you in the third grade when it will by the time you know, nobody even knows. How are your grades? I guess there's a record of that. Did you get your ass kicked in school? I mean, like just on and on and on. Be down. Yeah, beat down. So in whatever you do, do when you interact being a point showing up is really the thing i would say the single biggest predictor of you know success in a lot of ways is if you had a male role model growing up in in your life and a father figure in particular possibly because women are so much more biologically and sociologically conditioned to be around the kids it's rare that women leave their kids completely but the fathers it's very common
Starting point is 00:49:13 and that's number one predictor of you know the the boys you know next role model is most likely to be a prison guard right i mean that that that that happens in society. Yeah. I just think the barriers to entry need to be lowered. We need to do due diligence, you know, and obviously make sure screenings are done such as a mentor, you know, big brothers, big sisters, whatever, are safe, you know, environments because that's the absolute worst thing that could happen.
Starting point is 00:49:36 But I want to end with a beautiful quote from the book that you made, which is, again, this interwoven memoir, science, nonfiction with a factual life that's unique. There's really nothing else like this. If you liked Tara Westover, if you liked educated, if you like Rob Henderson's book, a recent one about his life in the foster care system, I believe, as well. I haven't read either one of them, but short form, if you're willing to come back on the Into the Impossible podcast, we'll have you back on. But you say beautiful prose, the promise, the commitment was a promise to myself that I would not end up like my parents. It was a vow to escape the chaos to find a different path. It was a declaration of war against the entropy of my own life.
Starting point is 00:50:19 It's really beautiful. Think about how you can use both sides of your brain, both of these hemispheres here, which we separated. I think it's three different had Alice in Miotrion, and we were taking it apart, and he was showing me different things about it. David Cecil, thank you so much for coming down to San Diego, and enjoy the rest of your day.
Starting point is 00:50:35 And everybody, this book is now out by the time the interview comes out, and the audio book is read by David. So you'll enjoy that. Better than the version I made myself read by Snoop Dog. That's another topic. Thanks a lot, David. Thank you for having me. Yeah, bye-bye.
Starting point is 00:50:48 Hey, you made it all the way to the end of this episode, so I know you're going to find fascinating this conversation with Dr. John C. Lennox. One of the greatest and most illuminating mathematicians and thinkers of our time. He's a delight to talk to, and I know you're going to enjoy our warm, loving conversation that we had about the perils and pitfalls of trying to turn AI into our next god. Click here for that. Don't forget to like, comment, and subscribe, and share the episode with a friend. See you next time. I'm Into the Impossible.

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.