Daniel and Kelly’s Extraordinary Universe - Can cosmic rays corrupt computers?

Episode Date: February 3, 2026

Daniel and Kelly talk about how computers store information, and how cosmic rays might flip some vital bits.See omnystudio.com/listener for privacy information....

Transcript
Discussion (0)
Starting point is 00:00:00 This is an I-Hart podcast. Guaranteed human. Black history lives in our stories, our culture, and the conversations we still having today. This Black History Month, the podcast, I didn't know. Maybe you didn't either. Digs into the moments, perspectives, and experiences that don't always make the textbook. Let me tell you about Garrett Morgan. Brough had to pretend he didn't even exist just to sell his own invention.
Starting point is 00:00:27 Listen to I didn't know. Maybe you didn't either. from the Black Effect Podcast Network on the I Heart Radio app, Apple Podcast, or simply wherever you get your podcast. When you save that picture of your cute dog or cat on your computer, you expect it to be there later when you want to show it off or send it to your favorite podcaster.
Starting point is 00:00:58 Side note, thanks for all the pet pictures. We love them and keep them coming. But how can you be sure that the information you store in your computer will be there uncorrupted when you come back? for it. Is our digital information infrastructure robust or fragile? Since this is a physics podcast, you might be wondering if there are physics reasons that your data might not be safe. There are. In this case, unfortunately, it's not aliens. I'd love if it were aliens. But the concern is radiation from space. We know that it can hurt our bodies and rewrite our DNA, which are essentially
Starting point is 00:01:37 biological hard drives. Can it also corrupt our computers, cosmic ray bit flips? What does that mean for data centers in space or puppies in space or pictures of cats on Mars? Oh my. Welcome to Daniel and Kelly's extraordinary digital universe. I study parasites and space and if something goes wrong with my computer, I have no clue how to deal with it. Hi, I'm Daniel. I'm a particle physicist, and I'm technically an experimentalist, but really all of my science is on the computer. Now, you told me that you were thinking about going into computer science before you decided to go into particle physics. So are you the person that everyone goes to when they have computer problems? Or now in the era of like MacBooks, is it just like too hard for you to
Starting point is 00:02:41 troubleshoot? Or, yeah, are you the go-to person? I'm definitely the tech support person in my family. Okay. You know, why won't this thing load or I want to print from here? I can't I. For some reason, everybody comes to me, even though, like, computer science undergraduate degrees is mostly about, like, red-black trees and sorting and whatever. And you don't really get expertise in that kind of stuff. Nothing useful. Nothing useful. Exactly. But you have very recently become aware of how fragile computers are, didn't you? Yes, I'm coming to you from Zach's Chromebook because I have within one year broken my computer screen for the second time. And it is in the shop again.
Starting point is 00:03:20 And have you managed to blame this on like weird particles from space? No, no. Just a lot of clumsy members in the Weenersmith household. 50% of the time it was my fault. The other 50% was my adorable children. Well, maybe today's episode is going to give somebody over there an excuse. I didn't break that computer screen. Even if my footprint is on it, it was cosmic rays.
Starting point is 00:03:46 In this case, it was their teeth marks. But yes, maybe they're. They can blame a really amazing combination of particle rays that came in just the right pattern. Well, today's topic really connects with a few of my interest because, of course, I like thinking about cosmic rays. They are particles from space, and everything I do is on the computer. Plus, this is a concept that appears a lot in popular science, cosmic rays, flipping bits on computers. And I thought, let's make sure everybody out there really understands how that works and how serious a problem it is, especially with all this talk of sending data centers into space.
Starting point is 00:04:22 Does it occur a lot in pop science? I can't think of a single example. Wow. Maybe I read the wrong pop science. Or maybe I read the wrong pop science. Can you name five examples? No, I'm just kidding. Just name one.
Starting point is 00:04:41 I'm winning. You put me on the spot, and I cannot name a statement. single one right now off the top of my head. But I have the feeling like this is something that I hear a lot in popular science or that people in general know a little bit about, but might benefit from understanding like the real physics that underlies all this stuff. So that in the future, when there is an onslaught of popular science articles, after we launch the first data centers into space, everyone will be well equipped. Well, but I mean, this is a thing that could be important for our lives personally. So like not only could this be important. If
Starting point is 00:05:17 we put data centers in space, but for example, one of our listeners, Simon, sent us this article where there was a plane that was flying and all of a sudden it dropped really fast and nobody died. Some people did get hurt, but the thought was that some part of the computer on the plane got hit by a cosmic ray. It flipped a bit, which Daniel's going to explain. And that sort of messed up the control of the plane and the plane dropped really quick. And some people got hurt. And so this is like a thing that could impact our lives. And at some point, I have another space-related story that I'll tell.
Starting point is 00:05:48 So, you know, this isn't just a thing that might be important in the future. It's a thing that matters now. Hey, that sounds a lot to me like a popular science article about a cosmic gray bitflip. I thought you said a sci-fi thing. Are you changed? No, popular. You gaslighting me, Daniel. No, play the tape.
Starting point is 00:06:06 We said pop-sci. This is a concept that appears a lot in popular science. Does it occur a lot in pop science? Pop science. Pop science. Popular science. Pop science. Popular science articles. Oh, oh, I thought you said sci-fi. All right. All right, anyway, Daniel and I aren't communicating well today, but that's all right. We're going to move forward. I'll give you the win on that one. All right, so let's lay some groundwork here.
Starting point is 00:06:33 But before we explain how everything works, I was wondering what people knew about how cosmic rays could mess up those pictures of your puppies. So I went out there to ask our volunteers their thoughts without any Googling, of course. whether cosmic rays can corrupt computers. Here's what people had to say. Perhaps the Earth's magnetosphere of protecting us from that issue. Cosmic rays are nasty. Of course cosmic rays can corrupt computer data. Cosmic rays definitely can flip bits in computer systems.
Starting point is 00:07:03 I'm sure cosmic rays can cause problems. I know they can with just drives. I'm not sure. Cosmic rays can absolutely corrupt computer data by flipping essential bits, even when we build in redundancy, I think there have even been stories of car accidents attributed to cosmic ray disruptions. I would think so because the higher frequency spectrum could penetrate, I wouldn't imagine, across the computer barriers.
Starting point is 00:07:32 Cosmic rays and computers do not get along. I think cosmic rays are messing with our computers all the time, and that's why we have checksums and stuff. Yes, I know for certain that in space electronics that are exposed to cosmic rays do have a risk of their bits getting flipped? Absolutely, cosmic rays can corrupt data. Satellites can be corrupted by cosmic rays, so they have to be hardened and redundant. I'm not sure about it on the surface of the Earth, though.
Starting point is 00:07:59 A cosmic ray flipped a bit inside the game as the player was playing and basically cheated for them. They would be able to alter the bits in a data stream, so yes. Great answers, as always. I think when someone says, can X mess up Y? My answer is usually, yeah, probably, like I'm negative. But I don't think I really knew that this was something to worry about until I was researching a city on Mars. And we came across some stories about like space radiation messing up computers. Yeah, people think of their computers as robust.
Starting point is 00:08:36 Like you put a one in memory somewhere. It's going to be there. It's not like some fuzzy piece of paper that can get over. written or destroyed or whatever. But in reality, these are physical systems and they live in the universe and the universe is not always a friendly place to store your data. Yeah, and you're not supposed to bite the screen either. So let's start with understanding how computers store data, because I still find this like slightly magical, which is kind of great. So how are computer storing information? Yeah, it's a great question. And fundamentally, computer store information using quantum
Starting point is 00:09:10 mechanics, right? You could store information on just like a long tape, like the way a Turing machine does. You're like writing ones and zeros. In principle, it can be anything, but you want it to be compact, so you could have like lots and lots of ones and zeros and not have it take up a whole room. And you want it to be fast so you can read it out really quickly and you're not waiting an hour for that picture of your puppies to load. And so we make it small and fast by using atomic systems. And at its core, all these atomic systems are built on semiconductors, which, is why you hear about silicon so much. Silicon is a semiconductor. Well, what does that mean semiconductors? Well, semiconductors sit between insulators, which don't conduct electricity,
Starting point is 00:09:51 and conductors like metals, which conduct a lot of electricity. And to understand why that is, you have to understand something about the atomic structure here. And we're used to thinking about atoms by themselves. You have, like, energy levels, and a silicon atom has, like, where electrons can be. There's like a ladder of them, right? And they can absorb and emit photons, this kind of stuff, there's like a very discrete set of energy levels. And that's true for atoms. And if you have like a gas of silicon, it's going to emit photons at certain energy levels
Starting point is 00:10:19 and absorb at those energy levels. And that's all cool. So that's atomic physics. Is atomic physics another word for chemistry? Because it kind of sounds like it's another word for chemistry. It's the foundation of all chemistry. Yet everything in chemistry comes out of those energy levels. And then it gets more complicated because you have atoms binding with each other
Starting point is 00:10:37 and sharing electrons. And if you take that to it, extreme, you get things like solids where you have a whole lattice of atoms. And now you don't have to solve these problems just for one electron. You have to think about what happens to an electron that's shared among many, many atoms. And so instead of thinking about a lattice of silicon atoms is like each a bunch of individual atoms with their own energy levels, these things overlap because they're so close. And so they spread these sharp atomic energy levels into bands. So we talk about bands of electron energy levels. So you shouldn't think about electron in a piece of silicon
Starting point is 00:11:12 is like, this one belongs to that atom, or belongs to this atom, or belongs to this other atom, it moves smoothly, right? The ions, the nuclei are bound together. They're sort of like fixed in a lattice, and the electrons are all moving around. And there's different energy levels there, which roughly get grouped into two different kinds of bands. There's the valence bands. Those are the low energy ones, where the electrons are mostly local and bound to a specific ion. But then there's above that, there's the conduction band. And these are usually empty, and the electrons can flow around. So if you have, for example, a metal, then there's a very small gap between the valence band and the conduction band.
Starting point is 00:11:51 And the electrons that fill up, the valence band can jump up to the conduction band, like getting on the highway really easily and flowing all around the metal. So if you put an electric field near a conductor, the electrons will jump up to the conduction band and flow all around. If you have a big gap there, like no entrances to your freeway, then the electrons are all stuck in their local area. And even if you put an electric field over it, they're really not going to move. So that's how to understand an insulator and a conductor, or something we call a metal usually. A semiconductor like silicon is something where that gap is sort of like in between. It's not really big. It's not really small.
Starting point is 00:12:29 All that silicon lithography helps you do some fancy local chemistry to make it silicon dioxide or silicon. and nitride that changes its conductivity, which means you can now separate components and make dielectrics and whatever you need to build circuits. And so you can design really, really small electrical circuits just by doping the silicon in various ways. Okay, so the reason you want to use silicon is because it's helpful to have something that's in between because you, well, so like, why wouldn't you just want to have a full insulator or a full conductor of, on your chip, as opposed to just using silicon everywhere and then tinkering with it. Tinkering sounds possibly more complicated than just putting conductors in some places.
Starting point is 00:13:17 Yeah, you could construct it out of just conductors wrapped in insulators, and that's basically how you build circuits on your bench, right? You have like copper wires wrapped in rubber, right? And that works. But it's really hard to do that at a micro scale, to manufacture that stuff, and to do it cheaply and at high volume. It's easier to take silicon, which can be tweaked in either direction. It turns out to be a lot easier to tweak it than to just build it from scratch.
Starting point is 00:13:42 Okay. And so to tweak it, you just like flick some germanium on it and now you're good? Yeah, essentially. We have a whole episode about how silicon lithography works and people should dig into the details there. There's a lot of nuance that's missed in this summary here. But roughly, that's why you want to start with silicon because you can easily change it to be a conductor or an insulator. I'll go check that out. My memory is just the bits lately. Okay, so now we've got our semiconductors. Yeah, and so you can use that to build all sorts of microscopic bits of your circuits out of silicon.
Starting point is 00:14:13 But keep that in mind for later when we're talking about what happens when a cosmic ray tears through a piece of silicon. So now you have the silicon and you want to build a computer. How do you actually use it to store memory? So your computer has several different kinds of memory depending on your need. So the memory you might think about are things like RAM, right? These are like things your computer remembers when it's turned on, but if you turn off the computer, it's empty. So like loading a program into memory so that your computer can run it. Right.
Starting point is 00:14:42 So this is, you know, random access memory. And these days it's called DRAM, dynamic random access memory. And random access just means that you can like get a piece of information from anywhere. You don't have to like read it in order like a book. You can just have an index. You can say, tell me what's stored here. Tell me what's stored there. As long as you know the index.
Starting point is 00:15:02 you can look it up. Okay, so I've got a semiconductor, which allows me to move electrons around. And to build circuits. And so now the question is, how do you use little electrical circuits to store information? Yeah. And so DRAM does this by using a capacitor connected to a transistor. So a capacitor physically, if you build a large one, it's just like two surfaces insulated from each other that are separated. And so you can have like a charge stored across them.
Starting point is 00:15:32 You can, like, put a bunch of electrons on one side, and then you have, like, a voltage across the capacitor. And so capacitor is very useful for all sorts of things in circuits. But in this case, you can just ask, like, well, is there voltage on it? If so, I'm going to call that a one. If there's no voltage across it, I'm going to call that a zero. You just need some sort of microscopic state, which you can control and you can read out. And then you assign the meaning of one to one of the states and zero to the other state. Okay.
Starting point is 00:15:59 But so what could actually be happening in the capacitor is there's a lot of different ways that you could have more charge on one side and less charge on the other, but it all gets sort of summarized down to one and zero. Yeah, exactly. Okay. Because this is digital logic. In the end, everything is analog. Like the universe is mostly analog, but we assign digital meaning to it.
Starting point is 00:16:18 And as these things get smaller and faster than the amount of charge the capacitor holds gets really, really small so that it's quick to read out, it's quick to assign, and it's easy to make small. So in this case, it's like a few tens of thousands. thousands of electrons hold to charge. And you can imagine more complicated systems. If you didn't want to use binary logic, you want to use trinary. You could have like two different thresholds. You could say zero is anywhere from zero to 10,000 electrons. And a one is 10,000 to 20,000 electrons. And anything above 20,000, I'm going to call it two if you have trinary logic. But we use binary. So it's like zero or somewhere above 10,000 electrons or 10 to 30 femtoculums. And so there's a capacitor that
Starting point is 00:17:00 holds it and then a transistor, which allows you access to it. And so that's the basis of DRAM. And this requires constant power. When you turn off the computer, that charge dissipates and it's gone, which is why when you turn off your computer, you don't remember anymore what was in its memory. Okay. So you've got the semiconductor and those electrons are jumping up into the highway and they're getting moved around in the capacitor so that they can store up these ones and these And it's critical there that you have conductors, like the plates on the capacitor made of conductors and insulators between the plates so that the chargers don't just flow across. Oh, okay, all right. So you want them to move and then you want them to stay once you've moved to them. Exactly.
Starting point is 00:17:43 Ah, okay. Exactly. But these things don't work great. Like there's a leakage there because these things are super duper small and the insulator isn't perfect. And so every 64 milliseconds or so, it just like leaks away. So your computer has to constantly refresh your RAM. Every 60 milliseconds or so, it rewrites the number onto it. It's not very stable. That doesn't sound like a great system. I mean, it seems to work great. I love my computer when it's not in the shop. It's kind of amazing that it works as well as it does. Yeah. You don't just like call up a picture and it's like totally corrupted. Yeah. Then there's another kind of memory, which is in your CPU. So your computer basically has like a hard drive to store information.
Starting point is 00:18:23 And then there's the CPU, the central processing unit that does all the actual calculations. Then it has access to the main memory that we just talked about the RAM. But when it's doing the calculations, it slurps stuff from the RAM into a special super fast memory called the cache, which is close to the CPU, so that it doesn't have to go all the way to memory, which turns out to be a little bit slow. So you have these like hierarchies of memory in your computer, which are smaller and faster or bigger and slower. So S-Ram is the fastest, smallest memory. It's like right there next to the CPU. It's like if you're trying to do your household budget, you don't keep all of the information right in front of you. You have like a sheet of paper or a spreadsheet with like the critical details right in front of you that you're calculating right now.
Starting point is 00:19:05 The CPU's cache is essentially like its little worksheet. And SRAM or SRAM stands for, is that slow random access memory? Is that what it stands for? It stands for static random access memory. Okay. As opposed to dynamic, which is the kind you have for most of your memory. Got it. And this is cool because it's built by a pair of logic gates usually.
Starting point is 00:19:30 So it's built using not gates. So not gates are something where if you take in a one, you output a zero. If you take in a zero, you output a one. So it's like a basic logic gate in the same category as like and gates, you know, that require two ones to output a one or an ore gate that requires either of the inputs to be one to output a one. Oh, yes, of course. And these again are built on top of transistor. which are made of silicon.
Starting point is 00:19:57 So you can tie a few transistors together to make a NAND gate or an Nod gate or a knot gate. All these basic digital logics are built on top of the same fundamental technology which are silicon transistors, which have these various levels of doping so they can do what they need to do. And so the way S-RAM works
Starting point is 00:20:13 is you have two knock gates looped on each other. It's kind of silly, but it works really amazingly well. Like, say, have two knock gates, if the first one has an output value of one and then you feed that into the second, one, the second one's going to have an output value of zero, right? Feed the output of the second one back into the first one, and it's going to keep the first one having an output value of one, because it's input with zero. And so it's just sort of like, if you have two of these things
Starting point is 00:20:39 in series and then looped on each other, they support each other. And there's two stable states here. Either the first one is one and the second one is zero, or the first one is zero and the second one is one. And either way, they just constantly support each other, confirming each other. Okay, I'm following. But I feel like the big picture here for anyone who is maybe having a little trouble following is that in all of these cases, what's happening is we are very carefully keeping track of where electrons are going. And they need to be going in particular places and stay in there. And so you have your DRAM, which is a capacitor and a transistor.
Starting point is 00:21:14 You have your S-RAM, which has this pair of knotgates, which are keeping your information. And these are more robust than your DRAM because they don't leak the same way. But these transistors are super duper small, and so like a little bit of tweaking can corrupt them. And then the last kind of memory on your computer is the hard drive. This is the most robust thing where if you turn it off, it will stay there. You can put your picture of your puppy on your hard drive and turn your computer off and on, it's still there. And this is also the slowest kind of memory and the biggest. And these days, this uses flash technology, which essentially like an insulated box,
Starting point is 00:21:49 and you try to trap some electrons in there, or you let the electrons out. And it's insulated so the electrons can't leave. And essentially, you say if the electrons are trapped in there, then it's going to be a one. If the electrons are not trapped, it's going to be a zero. And so this is like a really big energy barrier that's very robust and it's very hard to perturb. But it's kind of slow, which is why you use it for your biggest, slowest memory like your hard drive. Does your computer start with all the electrons that it needs and just move them around? Or does it pull electrons out of the environment when it needs them?
Starting point is 00:22:20 Why are you laughing? Oh, great question. No, I love this question because it touches on like, you know, electrons versus electricity, right? And mostly electricity is the motion of electrons. So you're not like running out of electrons in a sense. But people confuse these two things and they like to talk about electrons when they mean electricity. And remember that these numbers of electrons are super duper tiny, right? Like the number of electrons in silicon is like, you know, 10 to the 29 per gram or some crazy numbers.
Starting point is 00:22:52 So if you're talking about circuits that need like 10,000 electrons, it's really just you're never going to run out of electrons. So electrons are everywhere. We're all just living in a vast ocean of electrons. It's about controlling where they are and where they aren't and where they go. Okay. You don't need to recharge your electrons. But it sounds to me like a really fun scam. We can sell people like, let's top up your electrons.
Starting point is 00:23:15 All right. Well, Daniel and I are going to think of how we can monetize electrons. See if you can hear our commercial for it during the break. And we'll be back in a moment. We're back. Daniel just gave us a great explanation for how computers store information. And now we're going to hear about
Starting point is 00:23:50 how cosmic rays can mess that all up. So Daniel, what is a cosmic ray? I'm sure physicists have this absolutely nailed down. Cosmic rays are basically just particles from space, right? Rays is just like a generic name for a particle or sort of an old-fashioned name. And cosmic just means it comes from space somewhere. And the Earth is constantly being hit by particles from space because space is not empty. You might think of it as like a grand vacuum, but really it's filled with particles, just much lower density than here on Earth. So in our solar system,
Starting point is 00:24:26 for example, there's a massive producer of cosmic rays in the sun. It pumps out protons and electrons and photons and all sorts of stuff. And when the protons and electrons hit the earth, we consider those cosmic rays. And Jupiter also puts out a lot of radiation, contributes to cosmic rays. The center of our galaxy is a massive source of radiation because there's a lot of crazy stuff going on in there. It's hot. It's dense. It's nasty. Things around a black hole also emit cosmic rays because there's a lot of energy there. And so basically a lot of things in the universe are emitting cosmic rays. And they go from fairly low energy. all the way up to like the craziest highest energy things ever seen in the universe.
Starting point is 00:25:07 Which are what? Which are created by things we don't know. So for example, the large Hedron Collider, which is like the pinnacle of energy, the humanity's ever created, we accelerate things to have like 13 terra electron volts. It's 13 trillion electron volts of energy. But out in space, we see things that have a million times as much energy. Whoa. Yeah. Wow.
Starting point is 00:25:28 And we don't know what causes that. Like we know that the center of the galaxy emits some, but not that high energy. We know supernovas create some, but not that high energy. We know if you take particles from supernovas, whizz them around a black hole, that gives them a little bit more energy, but still not that high energy. So we see these particles from space that have crazy high energy and nothing we know in the universe can make particles that high energy. It's like a huge mystery.
Starting point is 00:25:55 Maybe it's a glitch in the simulation. Maybe it's alien particle physicists shooting their beams. at us. Maybe there's information there. We don't know. Okay, go ahead and mark that spot on your DKEU bingo cards, friends. I knew that was coming. Aliens, yeah. Yep. And so they impact life on Earth here because they hit us, but they hit the atmosphere. And the atmosphere is a great shield from cosmic rays, but it doesn't just like delete them. What happens when a particle hits the top of the atmosphere is it collides into the particles in the atmosphere. And so you go from having one very energetic particle to two that have half as much energy, to four with a quarter as much energy,
Starting point is 00:26:33 to a trillions of particles, each with a small slice of the energy, but still enough to make it all the way down to the surface of the Earth. And so we have a constant flux of cosmic rays on the surface of the Earth. It's like 150 muons per meter squared per second at sea level. So like every fingernail you have has hundreds of muons passing through it every second. They're everywhere. So you know what I'm wondering. Do I need to worry about that? Well, it contributes to, you know, changes in your DNA because cosmic rays carry energy and if they hit your DNA, they can change essentially the information stored there.
Starting point is 00:27:14 In great analogy to what happens when a cosmic gray hits a computer, right? This is how biology stores information, but it's not totally robust. You know, I'm not an expert in biology, but my understanding is that cosmic gray can either hit the DNA directly, or it can ionize water, which makes free radicals that react with the DNA. And in the end, you can have something which was originally like a cytosine and now behaves like thiamine and that gets read out differently by your molecular machinery. Now you're building a different protein and like, oops, now you have cancer. Or, oops, now you have like laser rays out of your eyeballs, right?
Starting point is 00:27:48 All sorts of mutations. And so I think this is a big part of our evolutionary history that cosmic rays contribute to changes in our DNA. At least it's my favorite explanation for how me and Katrina have an athletic sun. Wait, wait. When I got skin cancer, you mean I could have gotten lasers out of my eyes? Because that would have been way cooler. Anyway, sunscreen and broad-rimmed hats are your friends, friends.
Starting point is 00:28:13 Yes, exactly. And in that case, I think mostly skin cancer is caused by ultraviolet radiation and not by more deeply penetrating muons or neutrinos. But it's the same principle, you know, particles, in that case, photons, space corrupting the information. And so think of these things as just high energy particles from space that come down through the atmosphere. Okay. So you have these particles from space that carry a lot of energy. Mostly it's muons. They're also electrons in there. They don't penetrate as deeply. Mostly electrons are blocked by stuff like skin or, you know, like a small sheet of metal.
Starting point is 00:28:47 Muons have more mass, and so they don't interact as much. And so they can go deeper into an object. They're also neutrons. Neutrons are created in these showers. And because they're neutral, they pass through a lot of material. They're penetrating as well. Neutrons are less common. They're like 100 times less dense than muons, but they have a lot more mass to them. So they can really do a lot more damage. So you have these particles from space, and we told you that we build all of our digital
Starting point is 00:29:13 infrastructure out of semiconductors, which rely on, like, the delicate balance between a conductor and an insulator based on how exactly the electron bands are arranged. And you essentially build everything out of these building blocks. Well, what happens when a cosmic ray passes through a lattice of silicon? It can smash right into one of those atoms breaking up the nucleus, or it can push that atom, or it can just ionize a bunch of silicon. Or one of the silicon atoms that it kicked off can, like, flow through the lattice ionizing it. And so essentially, if you build your whole digital infrastructure out of this stuff,
Starting point is 00:29:50 it's like taking a Sharpie and just like scrolling right across the page. Okay, so it's not like it's, so I'm thinking about it as a highway now with cars moving along it, the cars being the electrons and the silicon being the highway. It's not that the cars are getting moved around. It's that you're essentially like blowing up the highway. And now the cars like they don't know where to go anymore. Yeah, essentially you're blowing up the highway. And think about, for example, your capacitor, right, your basic unit of memory in your computer. What's going to happen if you suddenly deposit a whole bunch of charge or you. you like tear through that and destroy the insulator so the charge leaks out. You're going to go from a zero to a one or you're going to go from a one to a zero, right? You've built this careful thing out of these bricks and then you have something to just tear right through it. And you're still have the machinery around it that's going to try to interpret what's going on as a one or a zero, right? And your transistor that's next to the conductor is going to be like, oh, excuse me, by the way, this doesn't seem to be like it's behaving itself. It's just going to be like, is there charge zero? Is there no charge? One. That sounds bad. Yeah, and the same is true for your S-RAM, for example. This was built out of knot gates, and these, again, are built out of transistors. And transistors have various components of silicon that are dope in various ways so they can do their logic. If you have a cosmic gray plow through that, it's going to deposit a bunch of charge.
Starting point is 00:31:12 It's going to change the way those gates operate in the same way. Now, hard drives, which are the most robust and the most difficult to change, they have that, like, insulated box. So this requires like stronger radiation, but essentially you want to have like charge in there or not charge in there. If you have a cosmic gray that comes in and tears through it and essentially deposits a bunch of charge, you're going to go from a zero to a one. Or if it breaks the containment and lets the charge out, then you're going to go from a one to a zero. So it's possible for every one of these components in your computer to be susceptible to cosmic rays very similar to how it happens with DNA. Okay. So that all sounds bad, but can we get a little bit more concrete about, like, does that mean your computer's just going to, like, 100% crash every time this happens? Or, like, what's the range of outcomes here?
Starting point is 00:32:02 So it depends on where it hits, right? It might hit your computer. And it just hits a place where, like, nothing is being used anyway. Like, you have an empty part of your hard drive. And it goes from zero to one, whatever, it was noise. It doesn't really matter. Or maybe you were about to store a picture of your puppy there. And so it goes from. from zero to one, and then you overwrite your puppy on top of it, so it doesn't matter, right? So those are the best case scenarios. You've overwritten your garbage with the puppy. Oh, good. I thought you were going to say, I overrode my puppy, and I was going to get really upset. No, that's a disaster. Yeah, meal is very cute.
Starting point is 00:32:36 Yeah. So in the sort of next stage of, like, problematic thing, it's like something goes wrong, and it's crucial, and your computer crashes. You know, like, your computer follows some instruction in its memory, and the instruction isn't what it was supposed to be because it got changed by a cosmic ray, and it does this instead of doing that, and it doesn't work, and it gets stuck and boom, your computer has crashed. That's actually a good outcome because you notice it, and it stops you from using it,
Starting point is 00:33:01 and then you, like, restart your computer, and you're fine. So your computer can recover? Yeah, if your computer crashes, you just reboot it, right? Like, it happens all the time, especially if you use Windows. But, oh, ouch. But isn't your silicon chip still, like, toast? Because it got hit? No, no. You can just refresh it. It's like it will change it from zero to one, but it's not like it's destroyed it, right? Usually these things, you know, are robust.
Starting point is 00:33:26 It's about the charge that's stored in that circuit. And so they can, like, release it or write into it accidentally. But you restart all this stuff, you're going to be fine. It's not permanent damage if it happens occasionally. We'll talk later about what happens to electronics inside the Large Hadron Collider and why that needs to be much more radiation hard. But, you know, if it happens once or twice, you're fine. You have like enough silhou in there for these things to still work. Okay. So then what is a bad result? The bad result is when it happens silently.
Starting point is 00:33:53 Like you have stored some data on your computer and a bit gets flipped and you don't notice. And later you read that in, you do some analysis and you're like, maybe you're doing some science or, you know, your puppy's eye color has changed or something. And this is when it's dangerous when it's a silent mistake, right? Or, for example, you're storing voting results from some precinct in Virginia and like, oops, a bit flips, and somebody now has more or fewer votes, right? This is when we really rely on this stuff. So not knowing that it's happened and having it happened silently in a way that doesn't cause any errors, that's the worst case scenario. Okay, this is getting scary. How often
Starting point is 00:34:34 does this happen? So there's bad news and then there's good news. Okay. So bad news is that it's not that rare, you know, because there are cosmic rays everywhere. So you should expect like a few bit flips per year per gigabyte of RAM, right? And I have like 30 gigabytes of RAM on my computer. And so I'm expecting like dozens of times a year, a bit, it's going to be flipped in my RAM. Oh, yikes. Exactly. And so that seems like an issue. But of course, the nerds know this and they have built protection for us. And it sounds like we're just running out there totally exposed. If you have a computer in an environment where you absolutely need it to be robust, then there are clever systems to detect these errors and correct them.
Starting point is 00:35:17 I find myself wondering, was there an era when we had computers, but we didn't know this? And there were like disasters because we didn't know this? Yeah. One of the ways this was originally discovered is that electronics near nuclear testing in the 50s had anomalies. People were like, whoa, what's going on with our electronics? And then they realized, of course, that the radiation from the blast was interfering with the electronics.
Starting point is 00:35:43 It was like in the 70s that they did. detailed physics involved was understood and then like demonstrated in particle beams. And so it was really like in the 80s people understood what's going on here and can we build protection. Oh, wow. Okay. Well, I'm glad we've got protection now. Yeah.
Starting point is 00:35:58 All right. So, wait, tell me how the protection works, please. So the simplest and the most expensive way to protect your data is just to have copies of it, right? Like if you have really sensitive data stored on a hard drive, hard drives used to be much less stable. You could just like lose a drive, like the head dropped on it back when they were magnetic spinning disks. And so we had these things called like raid arrays where you just had like everything was stored in duplicate or in triplicate even. And in triplicate is cool because if one of them gets corrupted, you have two other copies. And so you can vote.
Starting point is 00:36:27 You're like, is this bit supposed to be a zero or a one? And if two of them agree, then, you know, then that's the way you go. I think that's how like the flight systems work on the space shuttle, right? They all vote. There's three independent systems or something like that. And they all vote. So they always have to the degree. So that's the most complicated.
Starting point is 00:36:44 expensive way. And that's quite expensive because now you're like tripling the cost of your memory. If you want to store a gigabyte, you really have to have three gigabytes of memory in there just to make it protected. Yikes. And so since that's so expensive, people came up with cheaper, faster, more clever ways, things like check sums, right? Like if you send a message across the internet and you want to make sure the message arrived uncorrupted, then none of it got like garbled along the way, you can also send a checksum, which is like the output of a little calculation you do on the data. A simple way to think of a check sum is to be like, take the whole data set and treat it like a number, was it even or odd? And that way, if I send you a message and I also send you, by the way, it should have been even, then you can check to make sure it's even.
Starting point is 00:37:27 That doesn't catch all possible errors, like if you flip two bits, for example. But it's one simple way to say, like, were there any mistakes? So in modern computers, they do something which is more sophisticated than just a simple, like, is it even or odd. They have these parity checkers that take, like, groups of. of four data words. A word is like a bunch of bits altogether. And you have parity bits which check how many events there are and how many odds there are. And if you overlap these things in a clever way and you have this voting system, you can have like one parity bit checks three words and another parody bit checks three other words. And so through some clever algorithms and clever math,
Starting point is 00:38:05 you can detect any bit flip in any of those words without having complete duplicates. So this is like 10% of the data cost. You have a gigabyte of memory. You build a gigabyte plus 10% and you have enough data storage to detect if there were any bits flipped in your core gigabyte. But we didn't have this clever and cheap version during the shuttle era, and that's why the shuttle was so expensive. I don't know if the memory cost contributed a lot to the shuttle. And many computer systems have this already. So, for example, your hard drive very likely has error correction on it. Your RAM probably has error correction on it. CPU caches usually do not because this is the S-Ram stuff because it has to be super duper fast. It really will slow down your
Starting point is 00:38:53 computer if you have error correction on that CPU, like the worksheet that the CPU is using. But if you have a really important computer, like you're operating a data center and you need to provide really robust calculations for your consumers or you're running something in space, then you need to have error correction on your caches as well. Although I was reading that often in space applications, they take consumer PCs and they just turn off the CPU cache because they'd rather be robust than fast. And so they're like, this part is not robust in space, so let's just turn it off. All right. Well, let's take a break.
Starting point is 00:39:29 And when we get back, Kelly will tell a fun story about how galactic cosmic rays messed up computers in space. All right, we're back. We're on the International Space Station, and an alarm has just gone off. Samantha Christofareti, who's an Italian astronaut, was amazing at her ability to memorize all of the different alarms on the station. And even though this is the first time she's ever heard this alarm, she immediately knows what it is, and she yells, ammonia leak. And so she and Terry Verts have memorized the protocol for ammonia leaks.
Starting point is 00:40:20 And so what they do is they run over towards the Russian segment. and they close the airlock for the American side. And so what happens if there's an ammonia leak is that the coolant system for keeping the American side of the International Space Station is leaking. So ammonia is toxic. And so they can't stay over there anymore.
Starting point is 00:40:40 So they close off their side so that the toxic stuff can't get out. Then they're supposed to get naked. Then they're supposed to go through to the Russian side and then they're supposed to close the hatch and hope that the Russians have extradited. extra undies. And I saw Daniel look confused. And so Daniel, are you wondering why they closed to the second hatch? Why did they close the second hatch, Kelly? Just to make sure they didn't get
Starting point is 00:41:06 any more ammonia on the other side. But you're probably wondering actually why they were supposed to get naked, right? Yes, exactly. I was having fun imagining it, though. Okay, well, they apparently were not having fun imagining what it would be like because they skipped that step. But they were supposed to take the clothes off because the concern was ammonia might stick to their clothes, and they might end up bringing ammonia over to the Russian side. And the Russians use glycol to cool their side of the International Space Station. So they don't have this toxic stuff. So ammonia sticks to clothes, but not to like skin? I mean, it probably, it could stick to everything, but the goal is to bring as little as possible over to the Russian side. You ever think the engineers
Starting point is 00:41:45 are just like, let's add getting naked to this protocol and see if the astronauts will really do it? Yeah. Well, so I, you know, you've got to imagine that at some point, they were like, do I really want to see my middle-aged colleague naked in zero gravity? And they must have both decided, no. And so they went over to the Russian side, fully dressed. They locked the second airlock. And then they got permission to go back. And so they go back and the alarm goes off again. And they go through the whole process the second time. And yet again, they decide to not take their clothes off. And I believe they ended up saying they decided to not take their clothes off because they couldn't smell ammonia. ammonia has this like distinct rotten
Starting point is 00:42:28 egg smell. And it turned out that they were right. It was a false alarm. Was this the universe trying to get them naked? Is that what was happening? It could be. Could be. But the best guess that folks came up with afterwards for what happened is that galactic cosmic radiation had hit their computers and had inadvertently turned on the ammonia leak alarm. And there wasn't actually an ammonia leak. They never were able to find an ammonia leak. But it, you know, they started doing this emergency protocol and they had to eventually do it twice before they were able to fix their computers. So we're sort of talking about these data centers today. And I guess one thing I want to point out is that even though the International Space Station is to a large extent protected
Starting point is 00:43:08 by Earth's strong magnetosphere, which sends a lot of these charged particles, you know, to the poles and protects the International Space Station and the equipment and people within it, you still get hit sometimes. And so you still have to worry about the equipment. Where are people thinking of putting these data centers? Are they within the protection of the magnetic field or out past it? I don't think they want to go very far. Okay. Because the further you go, the harder it is to access and to repair. And I'm not on record as thinking that any of this is a good idea. And, you know, the cooling is hard. The radiation is tricky because you don't have the benefit of the Earth's atmosphere, even if you still have some of the magnetosphere. And so I think it's a crazy idea.
Starting point is 00:43:46 Pushed by tech bros who think it's going to be fun and sci-fi. And, you know, I don't know about that. We'll have a whole other episode digging into the economics of data centers in space. But yeah, out there in space, there is more radiation. And so you need more protection. And you need, like, shielding for these things, or you need to make extra error correction. Or you need to use technologies that are more radiation hard. It actually, on Earth, sometimes I'm frustrated by the amount of error correction. There are scenarios where it would be better for physics, even if it's worse for computers and for
Starting point is 00:44:21 Amazon or whatever, to have less error correction because all of these things are evidence of cosmic rays. And if you're curious about cosmic rays, where did they hit what's going on? Then one of the biggest challenge is seeing them. The really high energy cosmic rays are really rare. It's like one per square kilometer per century. So you either need a lot of square kilometers or a lot of centuries. And these massive data centers that are being built out are basically huge cosmic gray detectors. And imagine if you could use all of Google and Amazon data centers to observe cosmic rays from space. It would be incredible, right?
Starting point is 00:44:56 I actually looked into this and connected with folks at Google to see if we could use it. But their error correction is so good. They don't even store it. They don't log it and be like, by the way, we flipped a bit here and we corrected it. They just correct it and move on. How hard would it be for them to collect that data? Oh, man, it would require a change in pretty low-level operating system stuff, unfortunately. And, you know, for a while they were going to be willing to share the data with us
Starting point is 00:45:20 if it was like something they already had, they could just ship to us and we could analyze it. And I was so excited, but then it turns out it's not stored. And getting them to do anything just for fundamental physics is frustratingly difficult. Come on, Google. I thought you were trying to not be evil. Or did they, like, do away with that a long time ago? Yeah, I don't know.
Starting point is 00:45:36 I mean, it's not their obligation to fund basic research. Yeah. But it's frustrating to me because it means that these cosmic rays are there and they're actually observed by our technology. And then that information is just thrown away. You know, one person's error is another person's gold, maybe Nobel Prize gold, right? So that's frustrating. But there are other times when we think Cosette Grays could have affected things on Earth.
Starting point is 00:46:02 The voting example I mentioned earlier was not a hypothetical. There was an election in 2003 in Brussels where one candidate had a number of votes that was suspiciously off by 4,096. They're like, hmm, that's a weird number. Like, how did this number just appear in the computers? it's off by this specific power of two, right? For 1996, it's a power of two. So it could be caused by a single bit flip. And, you know, they went through and nobody tinkered with it.
Starting point is 00:46:29 There's no evidence that anybody, like, infected it or hacked it or there was a copy error or anything. They think probably it was a bitflip that caused this weird result in an election in Brussels. And so it can certainly happen, right? It's something we need to be aware of something we need to know about. Did it change the result of the election? No, they figured out. I think they knew how many votes have been cast.
Starting point is 00:46:49 in total. And so they knew that there was an excess and affected it. And I don't think it flipped the answer either way. So it's just sort of weird and makes people suspicious. And, you know, it highlights something else, which is that our whole digital infrastructure is sensitive to what's going on in space. We recently had a really nice cosmic light show because there was an extra amount of solar radiation, which led to northern lights all over the place on Earth. But that's extra solar radiation, and that can come at any time. In the 1850s, there was a really spectacular coronal mass ejection, an eruption of charged particles from the sun, which basically a huge loop of plasma, which heads towards the Earth, which is just high-energy cosmic rays. That's what
Starting point is 00:47:34 plasma is, right? And it can be like billions of tons of materials. Most of these things miss the Earth, because Earth is pretty small and far from the Sun, but when it washes over the Earth, it can devastate our digital infrastructure more than just like corrupting a single bit on an airplane or in a voting machine. This in the 1850s like set telegraph networks on fire. Like, you know, sparks were flying. Yeah, it's crazy. And if that happened today, like our digital infrastructure is not protected from that kind of incursion. You wouldn't be able to like shop for stuff online for days and days while people fixed it. I mean, can you imagine like if something took out all of our satellites? how absolutely poned we would be.
Starting point is 00:48:18 Yeah. Like no credit cards. I wouldn't have Starlink internet. There'd be no DKEU anymore, everyone. Can you imagine? Dot, dot, dot, civilization crumbles. Yes, right. Yep.
Starting point is 00:48:28 I agree. And so in particle physics, we've thought about this a lot because we are operating computers in a very high radiation environment constantly. Like we're creating that radiation. When you smash two particles together, you generate huge numbers of protons and neutrons and muons
Starting point is 00:48:43 all kinds of stuff. And then we have silicon near the collision in order to measure these things to detect these particles in exactly that way. But if you run it for a while, eventually your silicon gets trashed. So every few years, we have to pull the thing out and send in new silicon, basically. And so people are doing stuff like exploring whether we can use diamond instead. You know, we talk about silicon as a semiconductor, but it's not the only semiconductor out there. You can build these chips out of diamond, which makes them more radiation hard, which is very, expensive because there isn't a huge consumer diamond lithography industry the way there is for
Starting point is 00:49:20 silicon. So particle physics tends to make its advances by like piggybacking on huge trillion dollar consumer trends. And that hasn't happened for particle physics. So we don't have, you know, diamond chips yet. But it's something that people are looking at that we might have to do in the future in order to protect our computers from cosmic rays or radiation more generally. Can we take a step back to like, so if we had another Carrington event type of, of thing happen. Is there something we could have done to protect all of our satellites
Starting point is 00:49:50 from an event like that? Like, should we be encasing all of our satellites in diamonds, is what I'm asking. It would make them so glittery and beautiful. That's right. Well, essentially what you need is mass. Like, either you should build the chips themselves
Starting point is 00:50:07 out of diamond to make them radiation hard, or you should just shield them, in which case you need mass. And mass, as you know, is expensive to put in space. And that's why it's so difficult to have things like computing in space because it needs the thing, which is expensive to put. So I think it would look awesome, but I don't think we should bling up all of our satellites. Okay. So for the foreseeable future, we're probably not going to do anything about the fact that if we had a Carrington event type of thing happen again, we would be in a lot of trouble. Yeah, that would be a massive investment in infrastructure. And I don't see anybody doing it, though I think it would be a good idea. But until then, we'd Do need to be aware that cosmic rays are everywhere.
Starting point is 00:50:46 They're streaming through your room right now, and they're streaming through your computer, and sometimes they can flip a bit, a zero to one. They can plow their way through a bunch of silicon and change the way those delicate electronics work. Most of the time, this doesn't affect your computer, or it doesn't in an obvious way, or it's caught by error correction and fixed, but sometimes it isn't. And so next time your computer goes haywire, you could blame it on your kids, or you could blame it on particles from space.
Starting point is 00:51:13 Or the next time you get that super awesome science result, you should question yourself even more that anxiety should keep you up all night because maybe it was a cosmic ray and you were wrong. Maybe when you counted parasites, you're off by one. All right. Well, thanks for taking this cosmic journey with us into how silicon electronics work and how they are vulnerable to particles from space. As always, we are grateful for your shared curiosity.
Starting point is 00:51:42 Thanks, everyone. Have a good one. Daniel and Kelly's Extraordinary Universe is produced by IHeart Radio. We would love to hear from you. We really would. We want to know what questions you have about this extraordinary universe. We want to know your thoughts on recent shows, suggestions for future shows. If you contact us, we will get back to you. We really mean it.
Starting point is 00:52:10 We answer every message. Email us at questions at Daniel and Kelly.org. Or you can find us on social media. We have accounts on X, Instagram, Blue Sky, and on all of those platforms, you can find us at D and K Universe. Don't be shy. Write to us.
Starting point is 00:52:28 Black history lives in our stories, our culture, and the conversations we still having today. This Black History Month, the podcast, I didn't know. Maybe you didn't either. Digs into the moments, perspectives, and experiences that don't always make the textbook. Let me tell you about Garrett, Morgan. Brough had to pretend he didn't even exist just to sell his own invention.
Starting point is 00:52:51 Listen to I didn't know. Maybe you didn't either. From the Black Effect Podcast Network on the IHeart Radio app, Apple Podcast, or simply wherever you get your podcast. This is an IHeart podcast. Guaranteed human.

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.