From First Principles - Can Human Neurons Really Play Doom? The Science Behind Wetware (EP. 33)

Episode Date: March 24, 2026

Hosted by Lester Nare and Krishna Choudhary, this episode is a deep dive into one of the strangest science stories of the year: a dish of human neurons allegedly learning to play Doom. We go back to t...he original 2022 DishBrain paper out of Cortical Labs, unpack how biological neurons can be read and written with multi-electrode arrays, and then compare the peer-reviewed Pong result to the much newer Doom claim. The result is a story that is both genuinely impressive and, in places, probably overhyped.SummaryWetware engineering — replacing artificial neurons with real biological neurons plus electronics, and why some people think this could become a new computing paradigm.How DishBrain worked — human stem-cell-derived cortical neurons grown on a multi-electrode array, trained through sensory encoding and a “minimize surprise” feedback loop.Where the Doom story gets messy — the newer system appears to include a reinforcement-learning layer in the loop, raising the key question: are the neurons actually doing the learning?The big idea underneath the hype — even if Doom is overstated, the broader platform is still a remarkable step toward programmable biocomputing.Support the showDonate: FFPod.com/donateFollow: @FFPod on X / Instagram / TikTok / Facebook

Transcript
Discussion (0)
Starting point is 00:00:00 It's peak pollination season, and my business is scaling fast. To keep the nectar flowing, I need a phone plan with top priority data speed. That's why I chose GoogleFi Wireless. My connections stay strong even when the hive is buzzing. Plus, unlimited plans start at $35 a month. Now, that's a deal that doesn't stay. Explore Google Fi Wireless plans today. Plus taxes and government fees.
Starting point is 00:00:24 GoogleFi Wireless is not subject to data traffic deprioritization during times of high network usage. You said this place was steps from the water. We just haven't found the steps yet. How much did we save? Enough. Enough to get lost. Or you could book a stay with Hilton. Welcome to your ocean front room.
Starting point is 00:00:45 Just steps from the water. The Hilton sale is on now. Book on Hilton.com or the Hilton app and save up to 20% to get the stay you expected. When you want savings, not surprises. It matters where you stay. Hilton for the stay. It's called wetware engineering, and it's effectively neurons, human neurons, plus electronics.
Starting point is 00:01:07 So you cut out the middleman. The learning mechanism is literally minimizing surprise. If, on the other hand, it misses, it's going to get four seconds at 20 volts of highly chaotic electrical activity. So now I can very easily ask who is actually doing the learning. Correct. Is it my reinforcement learning network? or is it the actual biological neurons that I have? Doesn't count.
Starting point is 00:01:32 Hello, Internet. This is your captain speaking. Lester Nare, joined as always by my co-host and our resident PhD, Krishna Chowdery. And today we're going to be talking about a topic that sounds like it's something straight out of science fiction, which is, did a dish full of human neurons successfully play Doom the video game?
Starting point is 00:01:57 Now, this is something that has been, hyped in the news over the last two to three weeks with everyone trying to understand, is this real or is this hype? We're going to dive into the origins of how this came about through a research paper in Neuron out of cortical labs in Melbourne, Australia, published in 2022, and look at that paper versus the recent news. And we might even touch on some of the other big fundamentals that made this concept possible. We are going to learn about the science from the ground up today because this is from
Starting point is 00:02:34 first principles. So in our world of AI, we are very much used to artificial neural networks, right? And these are biomimetic AI in the sense that they mimic biology, right? Biological brains became artificial neural networks. The subject of this deep-demeatic, today is about a possible transition from this biomimetic AI to straight up biocomputing. It's called wetware engineering, and it's effectively neurons, human neurons, plus electronics. So you cut out the middleman, you don't even make any artificial neurons. You're using now biological neurons to help you in computing. Specifically, there's an Australian company called Cortical Labs.
Starting point is 00:03:37 They were recently in the news because their wetware computer could play Doom. And it was all hype, right? I've seen it literally everywhere. Yeah. The tech bros are going crazy. Yeah, all the doomsday people are like, should we be doing this? Right. So, you know, I naturally asked how much of this is real and how much of this is hype.
Starting point is 00:03:59 And it turns out there's a lot of both. Okay. And that's what we're going to get into. Okay. Okay. So let's do a little bit of brief background. Let's talk about the biological neuron, the one that is the substrasonic. the one that is the substrate of our own nervous system in our human brain.
Starting point is 00:04:16 A biological neuron is a cell, a biological cell. We see that on the left here. It's got a cell body. It's got dendrites, which are where other neurons come in and give you input. And in the cell body, what you do is you integrate, meaning you sum up all the inputs. And if all of the incoming inputs from other neurons exceeds some kind of threshold, then you fire what's called an action potential, which is, in other words, a spike.
Starting point is 00:04:46 It's effectively saying my neuron went from off to on because all of the other neurons that were talking to me, there were enough. There were enough neurons that were talking to me with enough input that I decided to go from off to on. When I do that, I go downstream, and the action potential goes downstream to other dendrites of other neurons,
Starting point is 00:05:07 and that's how we get a biological neuron. Now, in the artificial neuron, very much similar. You've got incoming neurons and their activity. The GPU of Nvidia sums that up and then fires if it fires like, you know, an output to downstream neurons if that output, if the input exceeds a certain threshold, right? So we've basically taken the biological thing and we've turned it into a very simple math equation. Now, obviously, in that approximation, we're not doing it complete justice because in biological neurons, there's structure, there's proteins, there's all sorts of little things, and it's not really just straight up integrate and fire. Though that's the simplest kind of model that you can create. But it turns out for a lot of the AI that we see, I mean, it's doing pretty well, right?
Starting point is 00:06:01 So even though we sort of have a simplification of this representation of biological neurons in our artificial. neural network systems, even that simple approximation in and of itself has valuable, productive outcomes. We don't have to replicate the whole of the brain in order to get some benefits from the concept. Exactly. And the reason why is because of the neural network, the idea that we can connect a bunch of these neurons into a giant network where everything is connected to one another, not
Starting point is 00:06:38 everything is connected, but you know, you can have layers of connections from one to the other. So in the biological case, again, on the left, you've got one neuron that's connected to the other. So the output of one neuron is connected to the dendrites of the other through something called a synapse. That's the gap in between neurons that let the neurons talk to one another. And in the artificial case, you've got hidden layers of neurons. And from one layer to the next, you've got inputs going in. Again, each neuron is going to integrate, meaning sum up and then fire if it exceeds some threat. So you've got these non-linearities, and then that's going to create the output, which is going to create the next output, which is going to create the next output.
Starting point is 00:07:15 And if you have trillions of parameters and millions and billions of neurons, then you can actually approximate any function you want, including, let's say, thought. And that's how you get large language models and all sorts of very, very cool things. So that's how we've mostly done artificial neural networks, right? Now, how is the brain different from these artificial neural networks? That's something that we should focus on here. Because these guys are trying to use biological neurons. If the artificial neural network is just that good, why would we want to try to use biological neural networks? There is an advantage that cortical labs and other people doing this technology say.
Starting point is 00:07:58 The idea is silicon models, which are the things in Nvidia and things like that, They use the von Neumann architecture of computing. The biological brain is just the biological brain. The von Neumann architecture is the following. Your processor is separate from your memory. Right? The memory is where you store stuff. The processing is where you compute.
Starting point is 00:08:20 Those are two separate things in a conventional computer, right? You've got your, like, RAM, and then you've got your CPU. And over the years, there's a huge gap between how fast you can compute things in your CPU. or your GPU, and how fast you can access the memory where you store stuff. In the brain, both are co-located. Okay. They're the same thing. They're all stored in the synapses, the weights of these things.
Starting point is 00:08:44 The memory itself is stored in the weights, and the way that you process stuff is you go from one neuron to the other in this neural network. But in an artificial neural network, there's that difference, right? So the idea is that perhaps this gives the brain an advantage. Another advantage that the brain has is that it's massively parallel, and it only uses like 12 to 20 watts of power for 100 billion neurons. On the other hand, state-of-the-art GPUs and neural nets use gigawatts of power for computing, right? So there's a clear difference also in the amount of energy needed to create that computing. So the differences between the two.
Starting point is 00:09:28 One is efficiency. We've created a simplified abstraction of what the brain does. and it takes a lot more energy to even do some of the things that the brain does. So it's a highly inefficient as compared to the brain naturally. Artificial intelligence is highly inefficient or highly energy consumptive.
Starting point is 00:09:48 Yes. Also the way in which the brain is structured, it can do significantly larger amounts of parallel processing which enables that energy efficiency in part. Yeah. And also like each neural, each neuron itself is like highly efficient as a computational unit, right? It doesn't require a lot of energy. We think, you know, our brain does consume 20% of the energy of our body, but still, that's not actually a lot when
Starting point is 00:10:18 you think about things, right? So perhaps the idea is we can use this natural biological wetware in place of our von Neumann architecture hardware, the GPUs that take up all that energy. That's the argument, okay? But in order to do that, we, need to be able to read and write to biological neurons. Right. And also where do these biological neurons exist? Yeah. Yeah. Where do they come from? They're usually in our brain, right? So let's get into how we actually read and write to real biological neurons in a dish. This was actually pioneered in 1972 by Thomas G. Pine at Caltech when he developed the first multi-electrod array. Here's the idea. So neurons produce electricity when they talk to one another. That's how they talk.
Starting point is 00:11:02 They use ion channels, which are little holes in their membranes, the boundary between the inside and the outside of the cell. And when those little membrane proteins, those channels open up or close, that facilitates the transfer of ions like potassium ions, chlorine ions, sodium ions, from the inside to the outside of the cell. Ions are charged atoms in some sense. And so when they move around, they're going to create electricity. They're going to create electric fields, and we can sense that if we have an electrode array, which is just a bunch of tips of metal that is connected to some kind of amplifier. And as each neuron produces a little electric field blip, I can sense that with my electrode array, right? I can also supply electricity to then influence the behavior of the neurons near my electrode.
Starting point is 00:11:55 Like if I've got a little metal tip here and I supply electricity, that is going to change the electric field around that metal tip, which is then going to influence the ions and thus the proteins on that neuron that are nearby that metal tip. And so I can now toggle between on and off based on that stimulation. So I've got to read, which is when the neurons fired, they're going to release a little electric field spike that I can read. And then I can write by supplying electricity and that's going to influence the neurons around it. I have a quick clarifying question. When you talked about the spike just now, you're referring to the concept you talked about earlier
Starting point is 00:12:31 of the action potential. Yes. This sort of like surge of electrical signal that then the neuron utilizes to do its next action or its next step. And so when we have the ability, we have the ability
Starting point is 00:12:47 to provide that energy or source material necessary for the neuron to trigger its action potential artificially. It's not only a capability that is not only a capability that is not reproducible artificially. Yes, exactly.
Starting point is 00:13:07 Yeah, it's something that we can provide a signal to whatever metal contact is near that neuron. And then, you know, that's going to create a change in electric field, which is going to then drive ions around that neuron. It's going to drive the proteins around that neuron. And then the neurons will be sensitive to that change in the electric field, right? They're going to think that it's another neuron somewhere that's like doing that. To just make a fine point on it, we can translate into the language that these neurons speak using this kind of methodology.
Starting point is 00:13:39 Yes, exactly. Yes. So now we can read and write? Yes. Okay. So can we do biocomputing? Well, early biocomputing actually started in the late 1990s, early 2000s. There was this guy, Steve M. Potter.
Starting point is 00:13:50 He was at the Georgia Tech, Georgia Institute of Technology. He developed this thing called a hybrid robot, which is a hybrid robot. he used cultured cortical neurons from rats on a multi-electrod array to actually do just to control like a robot. Okay? So again, this is a little plate that has a bunch of silicon electrodes, let's say. It could be something else. I don't actually know what, but today we use a lot of silicon in our electrodes, obviously. And he could grow rat cortical neurons on that electrode, like in a petri dish, and then sense and then right.
Starting point is 00:14:29 You know? So you've got to read in and then write out. You can listen to what's happening and you can talk to it to tell it stuff. Yeah. Yeah. So this was him controlling a robot. In 2004, this was in the news. That was pretty cool. Thomas DeMars cultured 25,000 rat cortical neurons in a petri dish. And that acted as an autopilot for an F-22 flight simulator. This was all over the news as well back then. And so that's pretty cool, right? Like almost 20 years ago, more than 20 years ago, 30 years ago is when we're doing the stuff. So how come now is when we can actually play video games? Okay.
Starting point is 00:15:10 The reason for that is one, we've had a huge improvement in how many electrodes we can fit inside our multi-electrodot array. Back then, we could only fit about 60. Now we can do thousands. So we can sense a lot finer detail. The bandwidth is greater. Yeah, yeah. It's like the density of the electrodes is literally greater. We can make them smaller. We can make them more sensitive. So we can now sense individual neurons rather than global populations. The other thing is that we need to keep these neurons alive, right, in the petri dish. And we've had a tremendous amount of scientific innovation in how to keep neurons in a petri dish alive for an extended period of time. Right? So now with all of that, we can get to, Cortical Labs, which is a new startup that came out of Australia that is doing this stuff, that is trying to make neurons in a pre-tradish play Pong and now most recently play Doom.
Starting point is 00:16:08 Allegedly. Allegedly. Allegedly. We'll see. Right? We'll see. Before that. We are just so grateful and excited for so many of you that continue to join us on this journey of curiosity and discovery. For those of you who are returning, welcome back. We greatly appreciate your attendance at our multi-week now show release schedule. As we mentioned in our previous episodes, we are now testing going from just doing one weekly episode to multiple with the rundown being it's on episode type. So if you're looking at your podcast feed or your Spotify feed and you're like, what's going on?
Starting point is 00:16:47 You're getting more of us more often, which has been in high demand. And if you do like what you hear, whether you've been here with us, for a long time or your new listener. One of the greatest ways that you can support the show is to give it a like, share, comment, a follow on the platform that you happen to discover us on or listen to us on. And if you want to go above and beyond, we do have a donation portal at the website, ffppod.com backslash donate where you can support the show with a recurring or one-time donation.
Starting point is 00:17:19 We are a two-man operation who've set up this unbelievable. production through the ingenuity of the both of us. And it is really a pleasure to be able to talk to you guys about the wonders of science multiple times a week. And we just again really, really, really cannot express how important it is to fight the billionaire algorithm to just engage in any way that you can, even if it's not financially, it's really helpful for us to get to more people so we can talk about the wonders of what the researchers and scientists all over the globe are providing us in terms of the future of humanity. And with that housekeeping note, we can jump back into cortical labs.
Starting point is 00:18:04 All right. So now let's talk about cortical labs. This is the subject. This is a startup out of Melbourne, Australia. I just recently watched the Formula One Grand Prix in Melbourne. Ferrari, no, it didn't. Mercedes is. With other one where Leclercée,
Starting point is 00:18:21 and Hamilton were battling each other? No, that was the Chinese Grand Prix very recently. Anyways, I don't want to talk about it. Cortical Labs, founded in 2019 by Hong Wen Chong. The company gained international attention in 2022 because of a paper that we're going to cover. It was in the journal Neuron, and it was describing this thing called Dish Brain,
Starting point is 00:18:43 which is a system in which in vitro neural cultures, so that's in vitro meaning in a petri dish, learned how to play the game Pong in a closed loop in. environment. Okay. So that is the paper that we are going to be talking about because that is the bedrock upon which their technology is built and it'll help us understand how exactly they did do. Okay. And it's actually very cool, some of the some of the stuff that they've done. Okay. So first of all, you got to get neurons for your wetware engineering. Yes. How do you get neurons? Well, first you
Starting point is 00:19:16 get primary mouse cortical neurons, those are fine to get. You know, mice are very much an animal that we use in biological research. So that's not that hard. Human cortical cells, on the other hand, are hard to come by. So instead of actually, you know, waiting for a patient who has epilepsy, let's say to have a little part of their brain excised and get that kind of cortical neuron, you never know with those kinds of samples like, okay, it's a brain that has epilepsy. that's not a wild type brain. It's not a neurotypical, as we should say, type brain. So instead, what they did was they generated human-induced pluripotent stem cells.
Starting point is 00:19:53 This is something that we cover a lot. It won the Nobel Prize because it's a technology where we can take somatic cells, like stuff from the biopsy of skin. We can reprogram them to become stem cells, which are blank slate cells that can be anything that they want. And then we can give them a cocktail of, chemicals to differentiate them into cortical tissue. So we take skin cells, we make them a blank slate, and then we make them cortical neurons.
Starting point is 00:20:22 I just want to make a quick note that this was, I think this was the Yamunaka. Yes, yeah. And this was really important because prior to that, embryonic stem cell research was highly controversial. And it was the only way to acquire stem cells. And it really unlocked the ability to actually be able to do this type of work by removing the sort of moral ethical conundrum. So now the point being any human cell you can reprogram to something else. Yeah. That wasn't, that's a huge, that's a huge foundational discovery. Exactly. That now, because I know the sort of stem cell piece for some folks, it's like they, that
Starting point is 00:21:01 historical point, because some people still think, oh, aren't we still getting those from embryos? No, yeah. This is just your skin cell becoming a stem cell. It's pretty insane. Now, of course, with the pluripotent stem cells, it's not a complete blank slate, right? The embryonic stem cell is like the gold standard because those things are built to be blank slates, right? So, but this, this comes pretty close for literally 90% of applications. So it's, it's done a huge service for the biomedical industry because for 90% of applications, I don't need to worry about that. That's still a huge unlock. Yes. I just want to be very clear here. So let's talk about the culture that they made. They made 800,000 neurons co-cultured with astrocytes.
Starting point is 00:21:44 This is pretty incredible here. Usually when we make stem cells, right, and we make a culture of cortical tissue, you're going to have all neurons. That actually is not ideal. For certain applications, it is. Like when we were discussing the ALS story, and we wanted to create an ALS mini brain on a Petri dish, we wanted it to be all motor neurons, right? because then that's how we test the drugs and so on and so forth.
Starting point is 00:22:11 In this case, we want like a live computing unit that kind of mimics the human brain, or at least a small chunk of the human brain. And the human brain is not just neurons. The human brain has astrocytes. So if we look over here, what they've been able to do is grow neurons along with astrocytes and all of the supporting cells. Okay? So the blue, that DAPI, those are marking the neurons.
Starting point is 00:22:41 The GFAP and the red, those are marking supporting astrocytes. And those are critical for long-term functioning. Okay. So this is sort of a secret sauce that allows those cultures in that Petri dish to survive for a lot longer. So now they can learn over a longer time scale. And, you know, actually, now we can take neurocomputing series. If you have a professional sports team with these high skill athletes, they're nothing without the training and medical staff that enable them to continue to perform over a longer periods of time given the amount of stress and things that are going on. Exactly.
Starting point is 00:23:20 And so these are all these supporting cells, which are crucial to creating a compute, like, you know, a little piece of computation on that petri dish. It's not just all neurons. You've got to have sort of a mimicked brain environment. And they were very successful at doing that. So now that's the side of the cells. We've created really nice culture with 800,000 neurons. That's a lot of neurons. Right.
Starting point is 00:23:43 Okay. Now let's go into the multi-electrod array. I'm sorry. I just want to interrupt you really briefly. Yeah. So the 800,000 neurons, would that be comparable to saying like the GPUs in your Nvidia cluster as like the corollary in an artificial system? Yeah.
Starting point is 00:23:59 Yeah. It's like the number of, it's the number. of transistors one can think of. Maybe. Maybe. Maybe. It's like, I mean, no, it's not quite because, you know, it's not one transistor to one artificial neuron. But in your artificial neural network, you have, you know, billions and billions
Starting point is 00:24:15 of neurons with trillions of weights in between them. Here you've got 800,000 neurons. It's still small, but it's incredible what they're able to do with it. Understood. You know? But that's the idea. Yes. You know? Okay. So that's our neuron piece. Now let's get into how are we going to sense them?
Starting point is 00:24:30 That's our multi-electrodes, right? Those are the little metal leads that let us read the neurons and then right to the neurons. So here they did a high-density multi-electrod array using CMOS technology, 26,000 platinum electrodes. What you can see over there, all of these little rectangles is an individual electrode. Okay. So that can be individually tuned to have voltage, you know, 5 volts, negative 2 volts, whatever you want it to be.
Starting point is 00:24:56 It can read the electric field around it and it can write an electric field around. it. Okay. So, and you can see the sort of, this is a, I think it's an electron microscope, and you can see the cell bodies and all the dendrites and all of the connections that are just sort of growing through that multi-electrod array. So when I have a connection, like a little axon that's connected from neuron A to B, I can follow that signal with my electrodes. Yes. Very, very cool, right? It looks like a map of L.A. from the sky. Yeah, yeah, it really does. Like, you know, the Spanish are very good at gritting. And that's what this looks like. If you're in Barcelona, it also looks like that, you know? Yes, yes. And out of the 26,000
Starting point is 00:25:39 platinum electrodes, there's only 1024 independent channels that are active. Okay. So they can pick and choose which ones they want to read through, right? So that's, again, a huge thing. And one of the incredible things is that the latency between reading and then making a computation to figure out what to write, right? Because I'm going to read the activity from the neurons. Then I'm I'm going to, I don't know, I guess, like, see what the game state of Pong is or Doom is, and then right back to the neurons and give them feedback. They got it down to five milliseconds. That's such a huge.
Starting point is 00:26:13 Which is quite a small latency. Yes. If you really think about it. And that isn't great. And if we're trying to mimic the brain, we need that latency as close to zero as possible. Exactly. Because for us, our latency is around that. Yeah, right.
Starting point is 00:26:28 Right. One of the common things that we think about is, like, our eyes. are about 60 frames per second is the top level of how many, you know, little pictures it takes every second and relays back to our brain. And so 60 frames, let's just say 50 frames per second because I want to make my math simpler. Yes. Right. So 50 frames per second, that's about 20 milliseconds is the refresh rate of my eye to the brain. Right? And this is going down to five milliseconds. So it's even lower than that. It's, that's, that's the speed. at which neurons talk. The action potential, which is this thing that I was telling you about
Starting point is 00:27:06 when the neuron turns off, it actually immediately turns off right after. It turns on and off. And that action potential takes about one millisecond. So getting it down is mimicking the sort of computational latency that the brain already has and what these neurons in your dish are used to. Yes. Right? So you don't have them like thinking harder than they need to. Then they need to. And which makes it more energy. Yeah. So now let's look at the dish brain protocol. How do we play Pong with our video game? With our DBP. Yes. Okay. So over here we've got in the middle our Petri dish that has on the bottom of it, this multi-electrodite array. These are little tiny electrodes that are going to sense the neuron culture that is on top. The neuron
Starting point is 00:27:51 culture on top is getting bathed with electrolytes, with glucose to keep it alive. They can keep it alive for as long as six months. And as these neurons in my Petri dish turn on and off, My multi-electrod array is going to sense that. It's going to read it in. It's going to sense the gameplay, and it's going to read it out into the neuron. Now, what we don't want to do is completely jumble up our read-in and our read-out. That was literally going to be my... The orchestration problem there is non-trivial.
Starting point is 00:28:21 Right? Yeah, because we've got a bunch of electrodes, and the brain is not really an all read-in, all-read-out, right? There are regions in the brain, like the occipital lobe that has the visual cortex. there's the prefrontal cortex that makes decisions there's the motor cortex that actually outputs my hand when I'm like let's say using a joystick right so there are segregated spots in my brain for where I'm reading in that'll be my visual cortex
Starting point is 00:28:49 and where I'm going and reading out that would be my motor cortex so they decided to create an artificial visual cortex and a motor cortex on the chip itself okay so at the very top On the right hand side, what you can see is the electrode layout schematic. So you've got your rectangular multi-electrod array that has 1024
Starting point is 00:29:12 electrodes that I can actively read out and read in from. On the very top, I'm going to make a rectangular box that's in the upper half, and eight of those electrodes I'm going to use as my, quote, visual cortex. It's really quite a sensory cortex, I should say, because that's what I'm writing into, right? the sensory cortex is getting information. When I play a video game,
Starting point is 00:29:37 the sensory cortex is through my eyes and through, I guess, sound sometimes, getting information about the state of the game. If you talk to call-of-duty players, sound is as equally important with their high-quality headphones to sneak up on you as visual. I'm just being annoying.
Starting point is 00:29:53 I mean, when I was playing Halo, I used to sometimes have to, like, play on mute because I was like 2 a.m., and my parents would like, you're like, what are you doing? So I'm actually quite good at playing Halo on mute, so I don't know about you guys. But anyways, the idea is there's a sensory cortex that is getting information from the brain. So I'm going to just arbitrarily make that top half of my electrode, the sensory cortex,
Starting point is 00:30:17 and that's where I'm going to write information to my neurons. And the idea here is we're segregating it from other section or sections such that it only has one thing to worry. Yeah, exactly. It's kind of like, I mean, I don't want to give too much. agency to these neurons in a dish. Sure. But what I'm saying is that those neurons in that upper half of my petri dish are acting like the sensory neurons.
Starting point is 00:30:41 They're going to sort of understand that, hey, I'm getting information about something. And I think this goes back to the point you brought up earlier, which is proximity creates stronger neural connections. Yes. And so because you're putting them in proximity and having a similar input, the theory is, you know, they will naturally strengthen around that shared idea. Very good. Yeah, exactly.
Starting point is 00:31:04 Those neurons in that region, they're all connected to one another. And so they're going to somehow figure it out. Right. We're going to get into how, but that's the idea. They're going to somehow figure it out. And actually, if we put that back, the bottom half of that Petri dish has those up and down arrows. Yeah.
Starting point is 00:31:20 That's my motor cortex. Okay. Okay. So neurons in the down region in that little block, if those fire, that's going to trigger my Pong paddle, I guess. It's a paddle, right? It's like a digital paddle. I know what you mean.
Starting point is 00:31:36 You know what I mean, right? Like that, it's going to trigger the paddle to go down. And if the neurons in the up region fire, for those electrodes, then it's going to trigger the neurons to go up. And sometimes you will have to calculate a difference between the up and down because, you know, maybe there's more neurons in the down region versus the up. So I also have to normalize based on like how many neurons. are because I don't want to just artificially keep going down if there's randomly more neurons in that
Starting point is 00:32:02 region. So there's a bunch of mathematical tricks you have to do, but at the end of the day, segregated regions. One set of neurons is going to fire if the culture wants to go down. And one set of neurons is going to fire if that petri dish culture is going to want to go up. Yes. Okay? So now we've figured out our reed and our right mechanism. The right is the sensory cortex at the top, eight electrodes, and our read is getting it in. And I think an interesting point about this is that geographical or locational, where it's located and how it's segregated, like form is function or form is following function in this context. Exactly, yeah.
Starting point is 00:32:39 Now, there's one other thing that I want to touch upon, which is how exactly that writing mechanism is being implemented. Right? Because what do we need to give this petri dish of neurons information about? What is all the information we need to give it? Well, we need to give it a state of the game, right? which is where is the paddle and where is the ball, right, in relation to that. Yes. Now, for us, when we play Pong, we've got a very nice visual system.
Starting point is 00:33:05 So we see where the paddle is. We see where the ball is. And we also see how it's moving. And so we can sort of all of our billions of neurons in our brain can like understand spatially what's going on because we understand space because we've had so much pre-training. And we can figure it out. Yeah. This is a neuron and a petri dish.
Starting point is 00:33:25 It does not have eyes. It does not have senses. We are creating the senses. So how are we going to do it? Well, they use something called rate and place coding. This is something that is very commonly observed in the human brain. Okay. Okay.
Starting point is 00:33:41 For example, let's take the example of the human cochlea, which is in our ear. That's how we sense sound. We sense frequency and we sense loudness, right? I've touched upon the cochlea before. The idea what the cochle is there's like this tube, a snail-like tube that is embedded in our ear. And what that tube is, if you were to unravel it, there would be neurons that are very thick at one end
Starting point is 00:34:09 and there would be neurons that are very thin at the other end. This thing gets wrapped up into kind of like a shell shape, right? But if I hear sound of a certain frequency, that is going to excite a neuron along this axis that is at exactly the right thickness, right? If the sound is very low frequency, like a bass sound, that's going to excite the thicker neurons
Starting point is 00:34:33 because they're bigger and the wavelength of the sound is bigger, so it's kind of just like, you know, the resonance is happening. And if the sound is really high, then it's going to excite the smaller neurons in my cochlea. So that's a place coding. That tells me the frequency.
Starting point is 00:34:49 Okay, okay? That's my place because my place along the axis is telling me the frequency of the sound. Yes. The rate coding is how loud is that sound? Oh, there it is. Right? Because if it's very loud, then that neuron is going to fire at a very high frequency. If it's not that loud, it's going to fire at a low frequency. So I'm getting two pieces of information with place and rate. The place is telling me the frequency. The rate is telling me the loudness, right? And this is something that is very common in biology. Where is, the neurons are firing tells us one piece of information and how much the neurons are firing tells me another piece of information. And those are simultaneously present when biological neural networks create computation. And just to continue to bring this back to the artificial comparison, I think what's interesting here is this is different than just being zeros or ones. Yes. Because you actually have two levels of input. You have the place. Which you could, let me not make that direct comparison. But the point is it's not as simple as a binary system.
Starting point is 00:35:56 Yeah. It's more of like of a two-state modulator. Yeah. No, I think you're on to something here. And that the neuron is kind of an analog system, right, in that sense. Now, some people will argue that the neurons are still digital because they're sending out spikes. And if you were to condense time and discretize it into smaller and smaller segments, then in each time window, the neuron is either active or inactive, right?
Starting point is 00:36:22 And the idea of a high rate is just there's more time windows that it's active. Right. So there is a really healthy debate in the neuroscience community about whether to interpret neurons as analog or digital. Okay. And I don't know where I fall on that. Okay. That's fair. But I think it's a fair point that you're making.
Starting point is 00:36:42 Like it's a very important point. Right. Right. Right. It's like that is still up for grabs. Because the answer to that will also then be relevant to like our conversation about LLMs versus other models that we had in the last episode in terms of if, anyway, you get the idea. Yeah, yeah. So to be clear, I just want to reiterate here.
Starting point is 00:37:05 What we're doing is using something that is in place and something that the brain is already used to, which is place and rate coding. In the idea of the cochlea, the place along this axis was the frequency. and the rate at which that neuron fired was the loudness. In this case, how are we actually, if we go back to photo 14, actually, how are we actually going to use that sensory cortex, the eight electrodes that we're trying to write in our information? How are we going to use place and rate coding to inform my petri dish brain about the state of the game?
Starting point is 00:37:43 That's the key, right? What they did was they said something very interesting. The, um, where the ball is or where the, um, sorry, where the paddle is. Yes. Is going to be my place. Mm-hmm. Okay. So if my paddle is lower on the screen, I'm going to fire the electrodes on the left.
Starting point is 00:38:08 I'm going to make the left electrodes give some electricity to my brain. If the paddle is up on the screen, I'm going to give the right electrodes. Okay, so now I've got place. What's the other thing I need to code in? That's the distance to the ball. How close is the ball? How close is it getting? Right?
Starting point is 00:38:24 And so for that, they use frequency. Yeah. They went all the way from 4 hertz to 40 hertz. 4 hertz means it's very far away. 40 hertz means it's getting close. You need to figure out which way to go. Yeah, yeah. Right?
Starting point is 00:38:38 And the idea here is we are basically creating a translation. between what we experience as a visual input where we can do the rate and place just through our visual input. That's right. Into an electrical input such that, but that is regional as a means to create an extra layer of like degree of freedom
Starting point is 00:39:05 in encoding more data in what is a relatively flat kind of system. Yeah, exactly. And we need to be able to, give those neurons in my Petri dish both of these pieces of information, right? And one thing I do want to be clear here is the neurons in this Petri dish are not directly seeing the game. Right. Like we are. This is kind of what I'm trying to...
Starting point is 00:39:24 Right? Polk at. Yes. They're not seeing the game like humans do. Instead, they're sort of seeing the game based on the paddle's point of view in some sense. It's like, where am I if I'm the paddle? And also how far away is the ball? Right.
Starting point is 00:39:38 Those are the things. And the only resolution is only like... like eight pixels in some sense, right? But from that, they're trying to make this Petri dish play Pong. Okay? So the other thing that we need to think about is fine. Now we've got a read and write mechanism. Yes.
Starting point is 00:39:57 Now we need a training mechanism. Right. Right? Yeah. Because now all of a sudden, I mean, we're telling the Petri dish the state of the game and we have some way of reading what the Petri Dish wants to do, which is move the paddle up or move the paddle down. How does the Petri dish know whether it's doing well or not?
Starting point is 00:40:16 Yes. Right? The challenge is actually quite non-trivial because in the brain, how do we get a reward? You know, we feel happiness. Maybe we got a cookie, literally. That is going to fuel dopaminergic neurons in our brain that are going to send signals of dopamine, pleasure, and all this other kind of stuff. So we've got like emotional signals for reward. We've got other crazy signals for reward.
Starting point is 00:40:42 this is a brain and a petri dish. It does not have dopaminergic neurons. Right. It does not have these centers and this complex brain anatomy to have reward. Yes. So how are we going to do that? And this, I think, is actually the coolest part of the paper to me. Okay.
Starting point is 00:40:58 Okay. It's this idea of a closed loop feedback architecture, and the learning mechanism is literally minimizing surprise. Okay. Here's what they're doing. If there is a successful hit, meaning the Pong paddle thing hit the ball. Yeah.
Starting point is 00:41:19 Okay? And we're good to go. What it's going to do is all of the electrodes in my multi-electroder array is going to send out a predictable electrical pulse. It's about 10 volts on every single electrode at 10 hertz. So 10 every second for about 50 milliseconds. Very short, very highly structured. Okay. Okay.
Starting point is 00:41:44 If on the other hand, it misses, it's going to get four seconds at 20 volts of highly chaotic electrical activity. And the idea is brains hate that. Yes. They want to be able to predict the future. Yes. Okay. And when you're giving them highly unpredictable stuff, even this naive network of biological neurons. Yes.
Starting point is 00:42:08 wants to decrease the amount of times it gets that. So if we look at the GIF, I just wanted to show one thing. So here we've got the brain coming in. You see that bump in electrical activity everywhere. And now it's about to miss. Yep. It's going to miss. For four seconds, it's just that there's noise everywhere.
Starting point is 00:42:26 Yes. Okay, we're feeding a bunch of electrical noise. The neurons hate that. Yes. Now the, it's going to go. It's going to actually, oh, I guess it's going to recycle. Yeah. Let's recycle.
Starting point is 00:42:37 So it'll recycle. When it gets there, a bunch of predictable electrical signal. That's all we're doing. Yes. Our reward mechanism is simply, if you got it right, we're going to reward you with a bunch of predictable activity. Yep. 10 hertz, 50 milliseconds. And if you get it wrong, four seconds of random chaotic activity. And I think this dovetails with some sort of historical science research that, been done around the, you know, when we look at how do we get ordered systems with entropy and this idea of everything goes to soup. And the way in which that works is by minimizing surprise. Yes. It's concept. It's like a biological. It is how life fundamentally is able to
Starting point is 00:43:26 instantiate itself. And that we're basically hijacking that natural kind of concept by artificially punishing the wrong decision with chaos which makes the system work harder and it is naturally going to want to not work harder and so that's why it chooses the right decision it's not it doesn't really know that it's the right decision it is just optimizing for the less energy efficient
Starting point is 00:43:57 pathway yes the less surprising the less surprising let's surprise yeah it's really the less surprising outcome right is what it wants right and now let's look at the results. Okay. Okay. With this training paradigm, let's look at the results. The apparent learning is demonstrated in five minutes.
Starting point is 00:44:13 So here what we're looking at is two sets of box plots. The green box plot is what happens from zero to five minutes. Okay. We're looking at the change in rally length, which is like the, you know, the, how many times the rally happens for the pong? It's just like in tennis. Like how many times do I keep the ball alive? And you can see from six to 20 minutes, there's a giant,
Starting point is 00:44:36 change in the size of the rallies from zero to five. So within five minutes, this thing is learning quite well. There's a up change in rally. And that's for the stimulus. For the controls, which is silent and no feedback, there's no change. Yeah. Okay? So we're getting a significant change only when they're actually playing the game and they
Starting point is 00:44:55 have this feedback. Now, if we go to the next photo, what we're also seeing is that human cortical neurons do a lot better than mouse cortical neurons. Okay? Yes. And that, I don't know, that makes me feel good. Yeah. That my neurons are better than a mouse's neurons.
Starting point is 00:45:09 We are better. Right? So, and that kind of makes sense where we've had evolution and our brains arguably are probably better than mouse. And the thing is, there was an improvement in both cases. It's just our improvement had a, how to, how to big, that's a good point. Very good. Yeah.
Starting point is 00:45:23 It's that ours had a much bigger improvement. Yes. Than the mice. That's compared to the mice. Right. So now that we have that, let's actually look at this free energy. principle that you were alluding to a bit earlier. Oh, interesting. Okay.
Starting point is 00:45:38 That's what it's called, the free energy principle. This is, was, it was first pioneered by Carl Fristin. And the idea is, what you were saying, all self-organizing biological systems want to minimize something called the free energy, the variational free energy. In other sense, what we want to do is maximize the information entropy. And we want to minimize the surprise and the prediction error. Okay. what we can look at is how much surprise there is for what is the probability of a certain state.
Starting point is 00:46:10 If the probability of a certain state is very low, there's a lot of surprise, right? And you can quantify that in the same way that we actually quantify entropy in physics, which is the log of the probability of stuff. And effectively what the free energy principle is saying is that biological systems want to maintain homeostasis, meaning like constancy over all the random stuff. And we want to minimize that surprise, right? And what do we do if we want to do that? Well, we've got some internal model of what the world is,
Starting point is 00:46:46 and we are going to tune the parameters of that internal model such that that internal model mimics more closely the world that we live in. That is the idea. It's a stark departure, I should say, and highlight, from traditional reinforcement learning. Okay. Okay? In traditional reinforcement learning, you've got some algorithm,
Starting point is 00:47:09 and that algorithm explicitly gives negative feedback through back propagation for wrong trials. Here, we are letting chaos itself be the negative feedback. Yeah, yeah, yeah. Now, one of the things that's kind of crazy is that the algorithms for reinforcement learning require thousands of epochs of trial and error to update the. these weights through back propagation. Right. Biological systems will inherently self-organize to minimize this free energy, and they'll
Starting point is 00:47:39 do it very, very quickly. They've just got really nice cellular mechanisms to find that sweet spot in their internal model. The energy efficiency piece again. Yes, exactly, right? And so maybe that's why we want to use the wetware in the first place, right? So let's go back to this FEP in a dish, our free energy principle in a dish. effectively what's happening is you've got predictions and then you've got prediction errors.
Starting point is 00:48:05 The predictions are where I want to move my paddle and the prediction error is the chaos that I get when I get it wrong, right? I'm trying to change the connections in my neurons on my petri dish such that I minimize prediction error and that's effectively what these neurons are doing. It's actually pretty incredible when you think about it. We're not giving them a bad and good signal. We're only giving them a chaos versus non-caus signal. Right, right. And that itself, just because of the nature of information
Starting point is 00:48:46 and the physics of information, is enough to steer them like, this is bad and that's good. I think that's, like, quite deep to think about. The universe never chooses violence. Yeah. It always chooses to Billy when given I'm being a little facetious No, but I think I think that's a very good
Starting point is 00:49:04 And that's a very poignant statement here So now let's look at like Some of the hype that they created around it Okay let's look at some of the promo videos And this was for the Pong era This is just 2022 This is we're still on 2020 Okay, just just confirm it
Starting point is 00:49:17 Yeah Because I know people are listening waiting for the doom Yeah, but we need to really understand this In order to get to doom 100% Okay so this is one of their promo videos look at how many this this is doing really well it's already at like rallies of three yeah maybe it'll get to four yeah yep yep is it going to get to four yeah got to four four four rallies right like this is
Starting point is 00:49:39 it's really it's really doing it five it was getting to five yes now let's look at the results from their paper okay okay okay from the results from their paper if we look at the left hand side yes figure b yes that's the average rally length yeah and look at the y axis yeah what is the y axis on it's on one right most of their trials are on one even on their box plot the edge of their trials for human cortical neurons which is the right hand side box plot yep is going to 1.5 right which is that's the i think it's the third quartile usually for box plot so it's like the 75% of data falls within 1.5 so clearly they're like showing a very nice example in their video right it's still statistically significant.
Starting point is 00:50:28 If you look at the human cortical neurons, which is the right box plot in figure B, from 0 to 5 minutes, the average rally length is at 0.5, and now it's bumped all the way to 1. And if you look at the spread of that data, clearly it's highly significant. If you look at all the other metrics,
Starting point is 00:50:46 you can see for human cortical neurons, they do the best compared to mouse cortical neurons and all the other controls. So there is clearly an effect, but the cortical labs team are, startup. And so, you know, as is tradition in startups, not just in Silicon Valley, but around the world, even in Australia, they're going to hype up some of the results. And that's fine. I still think it's a very cool thing that they did. Yes. And we just walk through why it's
Starting point is 00:51:12 incredibly cool. And if you think that AI is using too much energy, you should be all for wetware. Yeah. Yeah. I mean, it maybe has a lot of applications. So now we're going to scale this up. Okay. We're to scale it up to complexity level 1,000 because we are now trying to play Doom. They built the CL1 platform, Cortical Labs, first code deployable biological computer in the world. That's what it looks like. It's a little box that at its center has the Petri dish with all of the biome, like the biological fluids and all the things to keep the neurons active. You've got the multi-electrodot array. and you've got a little screen on the side that shows you the health of the neurons,
Starting point is 00:51:59 the amount of oxygen there is, the amount of CO2 there is, and so on and so forth. So it can regulate temperature, it can manage gas exchange, it can clear metabolic waste because these neurons are alive, so they're going to be creating waste, all of this other stuff, and they can actually survive in that in vitro petri dish for up to six months. That's great. It's a very long time, okay, for a sort of little mini organoid to survive. So how about the software?
Starting point is 00:52:24 This is very cool. So for the Pong story, they had a custom built C++ machine language type thing. Nobody wants to code in C++. I get that you want the base layer of your software to be in C++ because we want very low latency. Yes. Right? And C is an extremely fast language. Python is quite slow.
Starting point is 00:52:48 But at the high level, I want to code in Python. Okay. And then maybe the Python gets transferred into C. And what they did was they created something called a cortical cloud and a Python API. So now you don't need C++. And you've got a cloud computer of a bunch of CL1 platforms that are each, like, you know, little tiny brains on a dish that I can now just like get into my, you know, on my laptop, I can access. It's open, open platform. I mean, you have to like apply and say like this is what I want to do with it.
Starting point is 00:53:23 But this is, I think, very clever from Cortical Labs. Because even IBM, like one of the reasons why Kiskit, which is the quantum computing platform for IBM, like their software platform, that's used a lot because they've created a cloud platform where theorists and like people who want to do testing of quantum algorithms and things like that, they can go into Kiskit. They can apply for time on the IBM quantum computer. They've got like a dilution refrigerator with that IBM quantum computer. and they can like test out their algorithms and like their air correction methodologies and things like that. So democratizing it is a really good way for like a startup or like a small computer lab to be like, hey, we've created the hardware. Why don't you guys go at it? Yes.
Starting point is 00:54:05 Right? It's very matrix-esque. If you remember the scene where Neo-un plugs and he sees all the pods of humans to capture heat for the machines, that's what that effectively wetware server rack, AWS NVIDIA server rack. Exactly. Yeah, it's like an AWS, but for like wetware. For wetware. It's pretty crazy.
Starting point is 00:54:26 An AWS for a brain on a chip. That is so outrageous. Amazon Web Services, for those who don't know, is the infrastructure layer that provides the back end for a large amount of all of the software applications that you use every day. Yeah, exactly. So with that in mind, now that we've got a cloud computing platform, now independent researchers and developers can mess around with it. And this is where we get into shopping. He's an independent developer. Okay.
Starting point is 00:54:54 Who wanted to use the CL1 platform to play Doom Freedom. Classic. Okay? And the, you know, one of the, I don't know why this is a meme. But like, whenever you have like a new piece of computation or something, everyone's like, can it play Doom? Can it play Doom? 100%. Like, like, even the ESA has like made satellites play Doom and things like that.
Starting point is 00:55:16 So I guess it's a meme. and the guys at Cortical Labs knew this was a meme and knew that if they could make their little thing play Doom, it would get a lot of hype and it certainly worked. And I just want to give a shout out to John Carmack who was the inventor of Doom at Id Software
Starting point is 00:55:34 and literally was one of the pioneers that created all the video game. Like that was like the 3D platform like that concept and what they did at the time is what created all. first person shooter style or like all these types of games. So is Doom before
Starting point is 00:55:53 what's the James Bond? James Bond Golden Eye? Yeah. I don't have a chronological knowledge. But I feel like James Bond Golden Eye was also kind of like from what I've never played Doom
Starting point is 00:56:05 but I've seen the videos while I was researching this and it looked kind of like It's similar. Yeah. But it's not Doom. Yeah, it's not Doom I guess. I was a 64 guy.
Starting point is 00:56:14 I played Golden Eye all the time. I love the music. But Doom is. Doom is kind of like this OG. It's its own thing. Yeah. So it is way more complex than Pong. So, you know, one of the things that I, as I was like researching Doom and I was like watching some videos of people playing Doom, you know, it's really, you're transitioning from a 2D linear game like Pong to now it's kind of like this 2.5 dimensional.
Starting point is 00:56:37 I wouldn't really say three dimensional. Yeah. Because it's like first person and like the, it's not really fully three dimensional, I would say. But there's still, it involves complex navigation. threat detection, and all of this other very cool stuff. So it's a way harder problem than up down, how close. Yes, exactly, right? Oh, you're already getting there.
Starting point is 00:57:00 It's like those are the sort of salient things that I need to code in. You know, before in Pong, I needed to code in position of my paddle and where the ball is. Here, in this case, I need to figure out what direction the enemy is and how close they are. Anyways, let's get to the cells. We got 200,000 human pleurports in stem cells. So this is a fourth of what we had in Pong. It's fewer than what we have in Pong. Right?
Starting point is 00:57:27 But because of like the cloud computing nature and the fact that this is all in Python, the guys were able to map the engine to the tissue in just a week. And this is what it looks like. So you've got your chip and it's sensing motor neurons or it's sensing the neurons. And then it's playing Doom. This is crazy. Right? Yeah, yeah, yeah.
Starting point is 00:57:51 It's pretty cool. Yeah, and we're seeing this like, like the real. No, yeah, those are the neurons that are coming in. And then there, the petri dish is sort of controlling which way to point and which way to fire, right? So let's get into how actually they did it. Because in Pong, remember, we've got a sensory cortex, we've got a motor cortex. They're going to do very similar things. In this case, the proximity, we want to,
Starting point is 00:58:19 We want to encode information about the game state, right? So there's proximity, which is how close I am. Yes. And there's direction, which is where I'm trying to fire. They're using place and rate to code that, right? Now, we want to take the damage. So taking damage and dying is going to be the chaos. And if we shoot someone, that's going to be the predictive.
Starting point is 00:58:46 Reward. Okay. Interesting. Because I was wondering how I was going to make. Matt from Pong to Doom. Yeah. And that's how you do it. And for the motor neurons, there's specific firing patterns that are going to be
Starting point is 00:58:56 move forward. There's specific firing patterns that are going to be turned. And there's specific firing patterns for fire a weapon. And the performance, after seven days of training, the agent's kill ratio was roughly twice that of pure chance. Now, I don't know what that means. And this is where it gets kind of annoying because there's no paper, right? So there's no methods.
Starting point is 00:59:16 They just kind of said that. Right. I'm like, I don't really know what that means. Yep. But they're explicitly admitting that the cells played like a beginner who's never seen a computer. So it's not very good. Yeah, yeah. Right?
Starting point is 00:59:28 But it's still, I guess, higher than chance. Yes. So maybe that's something. One of the things that I found kind of interesting was that during the study, the agents actually generally settled for a strategy of survival, which is hiding rather than active combat. And this is something that is well known in reinforcement learning, right? just like go for the local minima, which is I don't want to deal with anything. I don't want any reward whatsoever, even the positive or the negative, because that positive or negative reward comes from a dense, like, chain of brittle action, right? I have to do stuff. I have to do stuff.
Starting point is 01:00:02 And at the very end, I might just get chaos and then it might just be bad. So why don't I just like go to a corner and try to hide? I wish they gave the, uh, the, uh, nuke testing AI, uh, paper people the same wetware to go to the local minimum. Because as we talked in our two episodes ago, all the major model companies, Anthropic, Gemini, Open AI, all used nukes 90% of the time. Yeah.
Starting point is 01:00:27 Which does not sound like what you're talking about. Yeah, yeah, no, this one is just, they just want to... They're like, we're just trying to hide. Yeah, I'm just going to try and hide. So, now we've covered what cortical labs is purporting they did in their YouTube video. This is where I got a lot of that information.
Starting point is 01:00:42 Yes. Now let's ask what's actually... going on. Yeah, yeah. Okay? Because there's a lot less info about the Doom game versus the Pong game. The Pong game had a full ass, you know, peer-reviewed paper and neuron. This one does not.
Starting point is 01:00:57 This one is just like blogs is all I got, okay? Here's my concern. And this is something that I actually read Tommy Blanchard's substack on. So if you guys want to check out his substack, he's got a lot more info on it. Here's a concern. There are eight input channels for Pong, right? But Pong is a very small game. I can reasonably
Starting point is 01:01:20 put all of that information into eight electrodes. How do I encode everything that's going on in Doom with only eight inputs? Each of which can only have a few possible values, like either the electrode is on or off. And the answer is you actually don't. You use a reinforcement learning model.
Starting point is 01:01:37 This is where it gets a little bit fishy. Okay. Okay. Here, this is from their doom neuron. They've got a little, it's kind of like a GitHub where they have a documentation on what they did. This is the architecture of what they're showing is they've got a computation unit and that is talking to their CL1 computer. Yeah, yeah. Okay.
Starting point is 01:01:59 In that computational unit on the left, right hand side, what you can see is there is a PPO policy, which is a reinforcement learning paradigm. Yeah. That is then encoding the. the input of the game that is then going as stimulation to the brain, to my brain in a petri dish. And what I get back is action, and that is getting fed back into my reinforcement learning algorithm. So now I can very easily ask who is actually doing the learning.
Starting point is 01:02:25 Correct. Is it my reinforcement learning network, or is it the actual biological neurons that I have? Doesn't count. It's not pure. It should really count. It's not pure. It's not that far away.
Starting point is 01:02:36 Like reinforcement learning, there's video games that have like, perfected reinforcement learning. Dota 2, OpenAI defeats Dota 2 world champions. So that's not that big of a deal. Correct. So the question is,
Starting point is 01:02:49 was the software decoder, which is that translation layer, was that actually in the learning algorithm or is the actual neuron doing something? Because in the Pong case that we already talked about in earlier... Very direct.
Starting point is 01:03:02 It was direct and it was maybe incremental, but it was demonstrable learning. Yes. And we could very visit say the neurons are doing the competition. The neurons are doing the learning because there's no funny business going on where like I've got a reinforcement learning agent that, okay, fine, so now it's playing a video game.
Starting point is 01:03:22 What's the big deal? It knows all the history of Doom's lore. Exactly. So here's how they address that. They actually did address it in their frequently asked questions section. They said, okay, isn't the decoder doing all of the learning? What they tested it was, how they tested it was through an ablation study. So they've got a metric called a breakthrough rate, which is the amount of episodes that scores above a negative 600 reward threshold. Now, negative 600 is pretty low because negative 1,000 is the floor. I've never played Doom, so maybe somebody can tell me if negative 600 is like a lot or a little. But here's the key metric that they said. They said that if
Starting point is 01:04:01 I were to use the neurons in the loop, I get 27% breakthrough rate. So my score is above that negative 600. If it's complete silence, so no input from the neurons, or if I get put random noise from the neurons, then I get only a 7 to 8% breakthrough. So the neurons are doing something. Again, there's not a lot of trials. They're just spitting out these numbers. It's not, you know, I don't see data. I'm just like taking their word for it that they wrote in a blog post that it's 27% rather than 8%. So from baseline, okay, fine. Baseline is 8%. 27%. Maybe I'll take your word for it. The next problem, that they addressed in their frequently asked questions is that there's a you know is it the encoder is it is it the thing that is encoding the neuron state and then feeding it into the reinforcement learning
Starting point is 01:04:53 thing because that itself is a neural network right right so is that doing all the learning well let's see so if you look at photo 30 this is their response they say that you know well the brain cells in the loop They're effectively saying that assuming the brain cells are static, this is not correct. The policy and the cells are dynamical systems. So the biological neurons have an internal state. And that is affecting the reinforcement learning. Now, to me, all that's saying is like, well, we actually made it harder for the reinforcement learning agent. Yeah.
Starting point is 01:05:24 You know, it's like, okay, the neurons are kind of like a noisy controller that I'm like using to create action to my game. And maybe, you know, to me it's like, okay, well, that just means my reinforcement learning agent is like okay with a noisy controller. Right. The last sentence is what is key. During testing, encoder weights were frozen and still observed improvements in reward, which means that I froze the way that I am reading my neurons. And still there was improvement, which means that the neurons were maybe doing some type of learning. Yep. You know, but there's still the encoder in between, even if it's an inference.
Starting point is 01:06:03 frozen state. And so how do you know that it's not attributed to some base some base capability in the encoder, even if it's frozen? Yes. So that's where I'm like, that's my naive take. That's exactly right, right. You know, it's like in the YouTube video, they talk about, oh, it's like, you know, if the neurons fire a certain way, the game was going to turn left. If the neurons fire a certain way, the game will move the character in the game will move forward. And if they want to fire, the neurons will fire a certain way. Well, not really.
Starting point is 01:06:35 If the neurons fire one way or the other, it might just be that the reinforcement learning agent is just like also interpreting the state of the game and just like overriding everything. So I don't think it's a very clean test is what I'll be honest about, right? And perhaps they're working on a paper that is going to elucidate some of these holes. I certainly hope so because I do want a petriarch. game. Right.
Starting point is 01:07:00 I'm sorry, I do want a petri dish brain to play Doom. I think that would be hell of cool. Yes. But I'm, I think the jury is still out because one, all of this is anecdotal. Yes.
Starting point is 01:07:12 So there's no hard data. They're not showing actual, you know, box plots about the spread and so on and so forth. There was another thing that I read just briefly that was saying that the seed in my neural network actually affects how the gameplay is done, right?
Starting point is 01:07:27 The random seed that initialized. as weights and things like that. It's like, well, if it's that fragile, can you really say something about whether the neurons are doing anything? So I don't know, you know. But that's my skepticism. And this is not to take away from the foundational discovery, that dish brain in the Pong context,
Starting point is 01:07:46 which is why we took the time to talk about how we got to the Doom version, which is like there is a there there. Yes. But the way in which they're trying to scale it up for the increased complexity of the tasks, they're trying to get the wetware to do. It is not clear that it is purely from improvements at the artificial human pluripotent stem cell level that's doing the learning. Exactly. It could just be the infrastructure that has now been built around it.
Starting point is 01:08:17 Exactly. That makes it kind of like an interesting thing. Exactly. But not brains on a chip are doing all of the heavy lifting. Exactly. And that's what all of the news articles. are saying, right? All of the news articles are like, a brain in a petri dish, play Doom.
Starting point is 01:08:34 I don't think anyone's mentioned this reinforcement learning paradigm in the middle. Right. That is kind of the whole game, maybe. Right. And because there's not a paper, again, like the Pong version, which they've previously done, the jury is going to still be out on this. And that's why it's really important to have peer-reviewed journals because a reviewer would be like, what, you know, what are you doing? We're doing that here on the pod, let alone in a professional context. Yeah. Exactly. Yeah. One last thing I
Starting point is 01:08:58 want to touch on is there was a lot of heat that the authors took for their 2022 paper because the 2010-22 paper in their title mentioned sentience learn and exhibit sentience there was another paper that came out that was a backlash okay again published a neuron as a rebuttal and they were effectively saying that you can't use sentience right or the idea of perceiving because the behavior that you're mentioning and you're observing here when the Petri dish plays Pong is one of homeostatic feedback stabilization. There's no high level cognitive intent. Yeah, yeah.
Starting point is 01:09:39 Right. And so you can't really say things like that because then with sentience comes ethical concerns. In the public lexicon, the term sentience means there's some morality now that we have established onto this dish. And if that's the case, well, what are we actually doing? Yes. Yeah, and I think we kind of, over the course of this episode, we've kind of landed on that same point, which is, oh, this is an interesting fundamental understanding of how you can control neural networks through electrical, like electrical read and write architecture to optimize for stability in the system.
Starting point is 01:10:21 That's cool, but that's effectively what it's, and then we're just connecting that to an external thing we are interpreting and giving value to. as like, oh, hit it with the paddle. Yeah. But it fundamentally. Yeah. Fundamentally, it's just trying to reduce surprise. But surprise itself is a mathematical construct that is trying to reduce, right? Just because of the way that the proteins and the neurons work when they get together.
Starting point is 01:10:48 Right. So it's a far leap to say this dish brain is sentient because it can sense and respond to an environment like Pong. Right. Right. But this, again, I don't know. This is again, this is, we're going beyond my pay grade, and I do want to be very clear about that, right? Because questions of consciousness, questions of sentience is something that I grapple with all the time. I have a PhD in like neural systems, right?
Starting point is 01:11:13 Like one of my PhDs was about how neural networks talk to one another and sleep and so on and so forth. You can create very simple neural systems that sleep and mimic behavior. I don't know what consciousness is, but for them to put that in the title is a bit weird. Right. And I think that's what the rebuttal was about. Which is, which I think is fair pushback. Yeah. Not to minimize the work being done and just to be clear, right? Like this is not trying to be like, oh, you didn't do anything.
Starting point is 01:11:40 No, because it's pretty cool. Sometimes the reviewers are like, yeah, you didn't do anything. Oh, yeah. That's not what we're saying. Yeah. It's just that there's a nuance about the really cool thing that is being done. And then the CNN news headline that says, brain on a chip is playing Doom.
Starting point is 01:11:54 Yeah. Which is not quite. accurate. Yeah, exactly. And so, you know, on this podcast, we're in the business of like just talking about what was done. Yes. What were the methods and what were the actual findings, right, and the results. And so, you know, it is what it is. They use a reinforcement learning paradigm. I don't know if that changes things. It seems to me that they have a lot more work to do. Yes. To convince someone like me that it's the neurons that are playing doom. Yes. And not some small artificial neural network. Yes.
Starting point is 01:12:25 There's not a lot of details. So I'm looking forward to hopefully a peer-reviewed scientific paper. You know? Cortical Labs, if you want us to come over to Melbourne. I think it's Melbourne. Yeah, it's Melbourne. Come in and do some coverage
Starting point is 01:12:39 as you prepare for release of a research paper. We are more than happy to join you. Hell yeah. And have the opportunity to learn more. We will send this episode to you to get your feedback as well or corrections. So please let us know if there's anything that we can do to clarify here. We are fit. Like this is super interesting.
Starting point is 01:12:56 Yeah. We are just curious to learn more. And so I'm sure they are already well ahead of us. Yeah. On that point. So we'll see what happens. We will cover. If there's a follow up, we will cover it again.
Starting point is 01:13:06 This was a phenomenal episode around some of these details. I just, we could talk about this forever. However, we said we were going to keep our episode shorter. So I'm going to quickly wrap us up here after that. Let us know in the comments. We have a comment for the community today, for folks to comment on what game do you want to see? Oh, that's a good one.
Starting point is 01:13:29 What game would you like to see the brain on a chip play next? Doom was number one. What's number two? I know what my thought is. Maybe we'll share it next episode. I'm your host, Lesterneri, joined as always by my co-host
Starting point is 01:13:40 and our resident PhD, Krishna Chowdery. Again, multiple episodes a week. We'll see you later this week. Yamava Resort and Casino at San Manuel is California's number one entertainment destination for today's superstars.
Starting point is 01:14:10 Catch the Jonas Brothers return to the Yamava Theater stage on April 30th, the powerful vocals of Demi Lovato on May 17th, and the signature Southern Country Rock of Eric Church on July 19th. Tickets on sale now at Yamavatheater.com, only at Yamava Resort and Casino, celebrating its 40th anniversary. You in? Must be 21 to enter. Relax and let Ralph's delivery handle your grocery shopping this week. We start with only the freshest items, then review your list and carefully choose each one. Then we pack it all up and deliver it in as little as 30 minutes so you can feel confident it's what you ordered. Fresh groceries, your way, with Ralph's delivery and pickup.
Starting point is 01:14:55 And right now, you can save $20 on your first delivery or pickup order. Ralph's, fresh for everyone.

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.