Catalyst with Shayle Kann - How AI is changing weather forecasting

Starting point is 00:00:02 Latitude Media covering the new frontiers of the energy transition. I'm Shale Khan, and this is Catalyst. We don't really understand how the AI models forecast it, but they are capable of treating the hurricane as almost like a large macroscopic scale object that is moving. They have like spatial awareness in a way that the old models didn't. Yeah. It's a really interesting area, I would say, of like sort of the science of how AI works, to understand exactly how they see the world in that sense. Coming up, where the winds are blowing, using AI for weather forecasting.

Starting point is 00:00:49 When utilities need flexible capacity they can count on, they turn to Energy Hub. Energy Hub works with more than 170 utilities, coordinating over 2.5 million devices to manage 3.4 gigawatts of flexibility, built for the moments when utilities can't afford uncertainty. Energy Hub builds and operates virtual power plants that utilities actually stake their grid planning on, coordinating EVs, batteries, thermostats, and more through a single platform built for utility scale. Predictive, verifiable, and designed to perform when it counts. Learn more at energy hub.com. Trillions of dollars are flowing into clean and critical infrastructure, but those investments

Starting point is 00:01:27 aren't driven by technology alone. They're shaped by markets, by policy, by capital, and by the institutions that connect them. I'm Alfred Johnson, CEO of Crux, and host of a brand new podcast, Critical Capital Capital. Each episode, I talk with people deploying capital, shaping policy, and building the clean economy. Tune in as we unpack how progress is actually made. Listen to Critical Capital on Spotify, Apple, or wherever you get your podcasts. Catalyst is supported by FishTank PR, an award-winning PR firm focused on climate and energy tech, renewables, and sustainability. FishTink is known for generating prominent and effective media coverage for the brands they work with. If you want a PR partner that's thoughtful, shoots straight, and gets results, you'll like

Starting point is 00:02:10 Fish Tank PR. To learn more about Fish Tank's approach, visit fish tankpr.com. That's F-I-S-C-H-Fish-Tankpr.com. I'm Shale Khan. I lead the early-stage investing practice at Energy Impact Partners. Welcome. All right, so here's a statement that I suspect would be pretty non-controversial. AI will improve weather forecasting.

Starting point is 00:02:36 It's obvious, right? and it seems like it must be true. I certainly would have agreed with that statement had you asked me before this conversation you're about to listen to. But to me, the interesting question is why exactly? Like, through what mechanism can AI improve weather forecasting? For that matter, how do we actually do weather forecasting today? And if it does get better, what are some of the likely outcomes that it will enable?

Starting point is 00:02:59 It's an interesting set of questions for me for two reasons. First, weather forecasting itself is important to a whole host of other categories. I care about. Obviously, resilience, but also energy and a variety of others, agriculture, etc. But also, it's interesting because I think it's exemplary of a whole host of

Starting point is 00:03:18 next wave applications for AI. LLMs are, of course, finding their way through everything that requires language. Now, there are world models starting to show up to try to revolutionize robotics and things in the physical world. What about things like weather, where we have used some machine learning

Starting point is 00:03:35 historically, but can we do better with transformers and the new architecture of AI that we're seeing in other categories. Let's find out. My guest today is Peter Pataglia. He's a senior director at Google Deep Mind, where he is leveraging the big brain inside the deep mind to improve weather forecasting. Here's Peter. Peter, welcome. Thanks. Happy to be here. All right, let's talk weather forecasting. I want to start maybe by having you school me a bit on something that I realize I don't know, which is how do we do weather forecasting, like currently? And maybe a little bit of history.

Starting point is 00:04:11 Like, have there been major shifts technologically and how we forecast weather historically? So maybe walk me through the history, such as it is, of how we forecast weather. And then, like, what do we actually do today? Yeah. So I have to admit, I'm actually a relative newcomer to the area of weather forecasting myself. So we had gotten involved in this several years ago. and it was sort of built out of a research program that was trying to model complex simulations including fluids and the earth's atmosphere is a fluid and one of the big challenges that we were

Starting point is 00:04:45 sort of interested in exploring was modeling the atmospheric fluid which is weather forecasting so I should say that I sort of have gone through this journey of learning about weather forecasting So the stuff that I'll say hopefully it's accurate, but, you know, forgive me if I make mistakes. So I think my understanding about the field is that really a lot of the, you know, historically weather forecasting was very important for agriculture and, you know, sort of other use cases that were very important for kind of day-to-day life. But I think it was about maybe 100, 150 years ago that you had agencies or bureaus that were starting to do like marine forecasting or like kind of. more systematically, collecting observations systematically and treating it as a science. But then probably about 50 years ago or so, you started to see the emergence of like large government public weather agencies.

Starting point is 00:05:38 So I think NOAA in the U.S. was formed in the 70s. I think ECMWF, the European Center for Medium Range Forecasting, was also formed in the 70s. And these are two of the big prominent weather bureaus. But most governments have a weather bureau, and they, it's sort of weather has traditionally been viewed as a public good. So this is something that, you know, they collect tax money and then fund their weather service. And the idea is that a lot of the, you know, it's not only is forecasting the weather useful, just again, like, what are you going to have an umbrella or wear a coat.

Starting point is 00:06:12 But for things that are more like, you know, there's a dangerous storm coming, flood, extreme heat, extreme cold, those types of things. And also a lot of sort of decision making, like agriculture, energy, transportation. And I think in general, weather forecasting is traditionally been understood to be something that's a very good investment on a dollar. So the public tax money that's invested in the National Weather Bureau's has significant economic returns on those investments. So that's kind of the, I think, my understanding of the history of the kind of standardized or official weather forecasting business in a sense.

Starting point is 00:06:52 Maybe one other thing I can say about the way to think about the industry. I think my understanding is you can kind of think about the weather forecasting industry is divided into, it's almost like a pipeline, really. There's this sort of government official weather bureaus that are issuing these like global forecasts that are predicting all sorts of weather variables, but typically more at like a coarser spatial resolution. And then you have this full, this like big post processing chain where they take the like sort of base forecasts and then they specialize them for different use cases. So, for example, like, when you look at the app on your phone and you see, you know, the chance of precipitation, that's not coming directly from NOAA. That's coming from other intermediaries that are like taking, you know, local weather station data and other historical information trying to kind of like tweak it and improve it and make it especially useful for your use case. And you see that sort of an energy and all sorts of other applications of weather forecasting. Is there a corollary to how that has worked historically to like what's what's happening with LLMs today, not to jump into the AI stuff too early, but just in the sense of like, is the global forecast, let's say NOAA's forecast, is that a big mega model that spits out this one big forecast?

Starting point is 00:08:11 And then what people are doing in the processing world is saying, okay, I'm going to take that model, but then I'm going to like fork it is the wrong word. but I'm going to fork it and add a bunch of additional data into it to try to make it better at a smaller spatial resolution. Like, I'm just trying to picture what it actually is. Yeah. So, I mean, I didn't say much about where your actual weather forecast comes from in terms of, like, technically. So maybe if I say that, then maybe it will sort of open the answer to that question. So, again, fluids, like the atmospheres of fluid. And in physics, we have fluid equations called the Navi-Stokes equations.

Starting point is 00:08:46 and that's they govern fluids like at all scales like from the largest scale structure of the universe which actually turns turns out to also be a fluid down to like what's happening in you know in your blood basically it's there's turbulence that determines sort of how your blood flows and it has kind of important implications now all things in between that right you have like weather and you know stream flow and other types of things like that all fluids and what happens like engineers have figured out that, well, so the fluids are very complicated to simulate. So in order to simulate them accurately, they need to approximate the solutions so they can run them on very large computers. Because they're so complicated, they would never run on a computer natively. You have to

Starting point is 00:09:33 sort of break up the computation and approximate certain things in order to actually, you know, model everything that's happening, for example, in the atmosphere. So that's called numerical weather prediction. The numerical is just saying that they're making a numerical approximation to these Navier-Stokes equations. And traditionally it's been run on supercomputers. So, like, a lot of the big supercomputer centers have either, you know, do a lot of weather forecasting or even built to do weather forecasting. And in many ways, it's been sort of a triumph of science and engineering that we've been able to, like, decade on decade predict not, you know, one day, two days, but like 10, 12, 15 days into the future,

Starting point is 00:10:12 it's hard to even sort of imagine. Like, the scale of that type of predictive accuracy was sort of unimaginable 100 years ago. People just didn't think, like, in two weeks we can kind of know what the weather's going to be. That's crazy. That relies on knowing, like, what's happening on the other side of the earth

Starting point is 00:10:29 as the sort of prevailing winds carry the, you know, moisture and the temperature and all that kind of stuff. And I imagine there's like an exponential increase in, complexity the further out into the future you get, just because, like, there are a variety of possibilities of what actually happens today, and each one of those needs to be taken into account when I'm trying to predict what's going to happen tomorrow and so on and so forth as you move into the future. Yep, that's the butterfly effect, right?

Starting point is 00:10:55 It's that a butterfly might or might not flap its wings, and then that will determine, like, a week later, whether there's a hurricane or not, right? So the idea is that little tiny changes or little tiny effects or lacks, you know, missing effects will cause, could cause huge changes in weather over time. And that's exactly right. So the fluid, the atmosphere of fluid is thought to be chaotic, which means that that's sort of the definition of chaotic. It's that little tiny changes can have huge, large impacts later. That's what makes it so hard. And there's coupling across scales.

Starting point is 00:11:30 Again, the butterfly flapped its wings, but then you have. like up in the top of the atmosphere stuff is happening. Now, that's, that is exactly why it's very, very difficult to sort of, you know, find solutions to the exact equations that govern the atmospheric fluid. It's, so we have to make approximations, and we use supercomputers, and we have all kinds of tricks. I should also say, like, another important thing to recognize is that the, when you generate, when weather forecast has been generated, the, the, actual prediction of the future is only half of the process. It's the second half. The first half

Starting point is 00:12:08 of the process is figuring out what the weather currently is. So if you, we have satellites and we have weather stations and balloons and ships and all sorts of information that are taking measurements of like what the weather is all over the earth. But again, using the butterfly as an example, you would have to know like where every butterfly is in principle to act to perfectly forecast the weather. So if you sort of think that through, you realize that it's not really, weather forecasting is always going to be fundamentally uncertain to some level. We're never going to be able to make perfect observations of the weather everywhere on Earth with the precision required to perfectly predict the weather a week out.

Starting point is 00:12:55 And so when weather forecasting, that's why you have a chance of rain versus like, it's definitely going to rain, right? And you have like a range of temperatures, especially as you go out in time. And that's, again, sort of what makes weather forecasting so hard. So the first step in weather forecasting isn't actually predicting. It's taking all the satellite data and all the stations and all the different observations and estimating the current state of the weather across the earth. And once we have that estimate, then we can make the prediction with the sort of. supercomputer. And that's that second part is what our team and a lot of the teams in the field who are working on AI-based weather forecasting have been especially focused on. But my guess is

Starting point is 00:13:36 that over time we're going to see other parts of the weather forecasting process being, you know, having having more and more AI methods that are coming in and trying to advance them. Before we get into the AI methods, I, it seems like we have generally even pre-AIDS, We've been getting, I mean, you tell me if the curve has been linear or exponential or flat, but like it seems like there's been, I don't know, fairly linear improvement in our weather forecasting ability for decades. Like we're getting more precise. We are also getting better predicting further out into the future, as you said, like, you know, a week, two weeks, et cetera. To the extent that that's true, I'm sure it's all these things. But like, how much of the improvement that we have seen historically has come from? I don't know, A, as you said, just like having better ground truth data on the current state of the weather, B, more compute, as you said, has been running a supercomputer. So we get more and more powerful computers. We can just run more and more complicated and Avastok's equations.

Starting point is 00:14:41 Or C, additional tricks, basically, that allow you to, like, do better predictions without adding more compute. Yeah, that's a great question. So I'll just admit, I don't know the answer to that. I think all three contribute. So I can say on the first one, data, yeah, we like have, there's, you know, better satellites that are flown and there's more, better systems for, you know, collecting balloon observations or these different sort of things. So we definitely are getting better data, and we know that that improves the quality of the forecast. We're also getting better models. That's definitely true as well.

Starting point is 00:15:16 We're building bigger supercomputers. They can operate at finer resolution. just, I think, in the last less than 10 years, the ECMWF, which has the best weather forecast, they increased the resolution, meaning that they had finer detail and space in their forecasts, and that allowed the forecast to be more accurate. So you see both like adding just raw compute power, but also improving the quality of the models and the approximations can also, you know, has also made, I think, a pretty dramatic impact. And I think that sort of blurs into your third category of, like, other tricks.

Starting point is 00:15:53 I think in general, you have, you know, without getting into the details of how the numerical models work, you can kind of think about them as a backbone that's making a sort of general prediction at a course scale, and then you have a lot of other parameterizations and trickery sort of under the hood and making finer and finer grain predictions and also updating the backbone to be consistent with the fine grain. and those are all being advanced sort of in parallel, and like the teams, these engineering teams and scientific teams are sort of working together to make these better.

Starting point is 00:16:29 The last thing I would also say, too, is again, going back to that post-processing part of the weather forecasting pipeline, it's not just, again, that like these large, you know, NOAA and ECMWF and these other large agencies, it's not just that they are improving the forecasts, it's that other parts downstream

Starting point is 00:16:45 and post-processing are improving what they're doing. So actually the first advent of like AI and machine learning in weather forecasting, or at least some of the earliest, was not like trying to overhaul the whole weather forecast process itself, but making, you know, using more and more like statistical methods and linear regression and nonlinear regression, neural networks and other types of earlier machine learning techniques to improve, not the base forecast, but like the specific application. So maybe we can calibrate your, you know, chance of rain better if we have a slightly better downstream model.

Starting point is 00:17:19 Virtual power plants are becoming a reliable way for utilities to manage capacity, but enrolling devices is just the start. What really matters is confidence, knowing those resources will perform when dispatched, and being able to prove it, from the control room to the living room. Energy Hub's platform handles the full picture, from near-real-time forecasting, locational dispatch, and the kind of rigorous verification that holds up when regulators, grid operators, or leadership ask, did it deliver? easy enrollment creates momentum, proven performance builds trust.

Starting point is 00:17:53 That's why more than 170 utilities rely on Energy Hub to manage over 2.5 million devices delivering 3.4 gigawatts of flexible capacity. See what that looks like at energyhub.com. We're living through a profound economic shift, and energy sits at the center of all of it. Trillions of dollars are flowing into power plants, transmission lines, battery factories, data centers, but the future of energy isn't shaped by technology alone. It's shaped by markets, by policy, by capital, and by the institutions that connect them. I'm Alfred Johnson, CEO of Crux, the capital platform for the clean economy. Join me for my brand new show, Critical Capital, as I talk with people deploying capital, shaping policy and building projects. Together,

Starting point is 00:18:39 we unpack how risk is priced, how incentives are structured, and how progress is actually made. Listen to Critical Capital on Spotify, Apple, or wherever you get your podcasts. Are you tired of overpaying for big-name PR firms, but not really knowing what they're delivering? Is your comms team wasting time reviewing lengthy messaging briefs and decks, instead of engaging journalists or producing content? Are you wondering why your competitors are getting press and you aren't? Fish Tank PR is an award-winning climate and energy tech, renewables, and sustainability-focused PR firm dedicated to elevating the work of both early stage and established companies. Whether you need to position yourself as a thought leader in between project announcements or translate complex ideas and technologies into tangible, compelling stories that resonate with the media, F-Tankpr.com. Check out fish tankpr.com. That's F-I-S-C-H-Fish-Tankpr.com.

Starting point is 00:19:34 So, okay, so we've been improving. We've been, you know, in more recent years, applying sort of the earlier versions of ML to continually improve. I guess I'm curious from your perspective. what the biggest gaps are. I mean, like, obviously, we don't have the ability today to generate a perfect forecast three months into the future. Like, it could always get better. But apart from just that element of it, are there any areas where you feel like, actually, there's like a real, it's really hard to do X? Is it like precipitation is a bugaboo or, you know, something else, right?

Starting point is 00:20:07 Yeah. I mean, I think there's sort of two ways to answer that. It's, you're always going to be limited by the quality of your data. So if you don't have good data about something, it's going to, you know, you're just bad, you know, garbage in garbage out sort of thing, right? So these models take an estimate of the state of the current weather and then predict what's going to happen. If your estimate isn't very good because your raw observations weren't very good, you're not going to get a very good forecast. So improving, just collecting more data and using the data you have collected to form a better estimate of the current weather, that's definitely going to always improve things. that's it's sort of a known gap right now we don't know exactly what the ceiling is we don't know like

Starting point is 00:20:48 if we've you know if we do this satellite or that station observation how is it going to improve things we might have an idea but we don't always know and sometimes we have to just test it out but the um yeah the the other thing i would say is that you you have different features of weather which are harder or easier to predict so an obvious one is temperature so temperature is sort of very smoothly. Like if you look at a map of the temperature across the earth, it's sort of, you know, it's not, you know, up a mountain. It's going to be colder and like in a valley it'll be, you know, different. But it's sort of very smoothly. What doesn't very smoothly is precipitation. So like a rap, you know, a violent thunderstorm that sort of emerges out of nowhere

Starting point is 00:21:31 and there's like high wind and low pressure and all this kind of stuff. That, like, the, like, where exactly that front will be, like where exactly the precipitation will happen, what exactly the wind and these kinds of things are much, much harder because the detail of, like, everything's happening at a finer scale. Like where, you know, even if you look at a radar map, you can see this. It's not like precipitation sort of varies smoothly over the earth. You see like there's a little, you know, thunderstorm right there or like there's a, you know, rain and it's, and then, you know, a few miles over, nothing. So that type of very high resolution, complex, you know, patterns of precipitation, for example, wind as well, those are much harder to predict because you're effectively predicting a lot more

Starting point is 00:22:16 information. You can't just sort of summarize it by saying, oh, every 25 kilometers or whatever, the temperature is this, and then everything else is just kind of interpolated in between. You have a lot of stuff happening at a much finer scale, finer scale than a lot of our models even capture, and then we have to do a secondary step to try to resolve those finer details. All right, so let's talk about AI then. I mean, you mentioned this is one thing, right? Whenever we talk about AI, quote fingers applied here, right? Like, there is ML as a subset of AI, a related entity.

Starting point is 00:22:50 We've been doing ML already. So I guess the first question that I have is, as you think about leveraging AI now and into the future for weather forecasting, what version of AI are we talking about? Like, what version or versions are you actually? What are the actual capabilities and or model structures that are interesting here? Yeah, that's a good question. So, yeah, these days, I mean, AI is a pretty catch-all term. I find myself just using the word AI just to mean a lot of different things because I think it's kind of easier.

Starting point is 00:23:27 Right. Usually people kind of know. The difference, but the way I would say it is the difference between AI and so machine learning is sort of the like statistical inner core of A. AI. It's trying to capture, taking data and trying to capture the patterns through a training process and then, you know, kind of use some inductive assumption like, what we've seen in the past is going to be similar to what we see in the future. Modern AI, I think, is a broader family of things. It sort of involves, like, you know, agents and your interactions with them, and a lot of, like, language models are often, you know, sort of associated with AI.

Starting point is 00:24:05 what we use in our weather forecasting models and a lot of folks out in the community are using as this field is advancing and this AI-based weather forecasting is developing. We're still mostly using fairly traditional machine learning, supervised learning. So supervised learning just means you take a data set that has a pair of examples, an input example and a target example, and you train a model to try to take into. input examples and accurately predict the target examples. And so if you think about weather as, you know, again, like I said before, you're estimating the current state of the weather and then the next step is to predict what's going to happen next, right? That's, it can be a supervised learning method can be trained to do that, and that's what

Starting point is 00:24:53 we're doing in our models and like most folks that I see are doing as well. And then the only extra step is that just makes one prediction, but then we feed the output of the model back into itself. and then we have it make another prediction. So the output becomes the input, and then it generates another output. And if you just sort of chain those steps together, you get your first input, and then you get a sequence of outputs that represent future steps in time. And so supervised learning, we use a lot of these days in terms of AI architectures,

Starting point is 00:25:24 transformers and graph neural networks are what we use. People use convolutional neural networks. But I don't feel that neural network architectures these days tend to be the sort of exciting part. it's usually more of like the training and the sort of data, how you handle the data and that kind of thing. So, but you mentioned Transformers, because I guess, like, if we had been having this conversation five years ago, right, I imagine that you still would have told me about supervised learning, for example, right?

Starting point is 00:25:53 Like, that wasn't, that's not new. Transformers had been invented by that point, but, like, had not been, you know, broadly applied the way that they are today. So what is it that like this new wave of AI unlocked by things like by transformers and convolutional neural networks and so on? Like what does that enable above and beyond what you would have been able to do five years ago? Yeah, that's a good question. So the way I look at it is so transformers are very similar to graph neural networks. They both of them are.

Starting point is 00:26:27 So actually, let's let's take this back. So we used to use often convolutional neural networks. And the idea here is it learns a little function that's sort of local in an image. And then it sort of applies that same function everywhere. And then you stack up sequences of these layers. And that eventually lets you like one, the information on one side of an image communicate with the information on the other side of the image because it's a hierarchy. A transformer architecture allows you to make a direct connection between the information on one side of the image and the other side of the image.

Starting point is 00:27:00 the same way that graph neural network does. And I like to think about it like graph neural networks because what it's like saying is, well, in a graph you have nodes and you have edges or connections between the nodes. And a longer connection between nodes is for nodes that are farther away and shorter connections are for nodes that are closer.

Starting point is 00:27:19 So if you use the graph neural network analogy to describe the older convolutional networks, it's like the graphs are all small. They're all kind of, everything's kind of close. It's like nearby in an image. Graphnal networks allow you to choose how far a way you want information to interact. And in Transformers, it can be understood as a graph that has connectivity across any spatial scale. So in language models, the way a transformer works is it says, when I want to make predictions

Starting point is 00:27:47 about the next word, I want to be informed by the most recent word, but also every word that has happened in the text before. And that's important because in language, the next word is. not predicted by just the previous word. It actually is predicted by stuff that happened earlier in the sentence or in the paragraph or in the, you know, book. And so the ability to make, to have information across large spatial scales that interact with one another, that ability allows you to, it opens sort of new patterns of computation and allows you to represent functions that have traditionally been harder to represent, but it allows you to make, you know, better predictions of the next word or the next, you know, in our case, you can kind of think about it as the next spatial point or a faraway spatial point.

Starting point is 00:28:34 And that allows the model to be more flexible and capture richer functions. There's a good comparison there. What do you think of as being, I guess you just described something that is similar about what you can do in weather forecasting, thanks to a transformer architecture, to what you can do with large language models, which is what most people are going to be most familiar with in the, in the new wave of AI. What's different? So that's a great question. I think what's interesting is that the way that, so in language, the text is understood to be, is treated as a sequence. It's like, you know, token, token, token. We are also modeling sequences in weather, but we're not allowing our models to look too far back in time. So because weather is actually different from text in a fundamental way, in fact, most physical processes are, they are what's called Markoff in that the most recent state of the system determines the subsequent state.

Starting point is 00:29:38 So like I said in text, that's not the case. Right now, I'll just pause. You didn't know what word I was going to say next, right? It kind of depends on the context, a bunch of words behind it or earlier. With weather forecasting in principle, if you know exactly what's happening right now, you can fully predict what's going to happen next. You don't need to look further back in the past. So we actually use transformers not to model the interactions in weather over time,

Starting point is 00:30:06 like the sequence of text, but in space. So in text, you actually don't have a sense of spatial structure, right? You just have one sequence of text. It's just word, word, word, word. When you read, you just see word, word, word, word. In weather, you have spatial structure, you have weather all over the Earth at the same time. And it's all, you know, especially the close weather, it sort of determines and can be used to predict what's happening next at, you know, our current location. And so we use transformers and graphenone networks to capture the short and long range spatial dependencies.

Starting point is 00:30:44 And those interactions between, you know, what's nearby and what's about to happen now. next are what determined weather, and that's how we sort of make these predictions. But one thing I should also add is that similar to how I was saying earlier that it's kind of impossible to measure everything that's happening on the earth and the fine detail in weather. You have to make approximations. These models do too, and this actually brings us to a very fundamental difference between how AI models are making their predictions and how traditional models are. So AI models can take the statistical structure of weather patterns.

Starting point is 00:31:22 So, for example, if I'm looking at a hurricane that's traveling over the earth, right? In a traditional model, the way it simulates that is in a very fine detail, it kind of figures out like, what's the pressure and the temperature and the wind and the moisture and what are those things? What's going to happen next is determined strictly locally. AI models, because they can look at a much larger spatial range, they can use what traditional methods use or they can use other approaches. Because when you look at a hurricane, it almost looks like an object sliding over a globe. That's not how a traditional model models it. And we don't really understand how the AI models forecast it, but they are capable of treating

Starting point is 00:32:06 the hurricane as almost like a large macroscopic scale object that is moving, because they can see all the structure of the hurricane and they can see, you know, sort of what's happening in the recent past. They have like spatial awareness in a way that the old models didn't. Yeah. And we don't know. That's, it's a really interesting area, I would say, of like, sort of the science of how AI works to understand exactly how they see the world in that sense.

Starting point is 00:32:32 It seems like to me, on one hand, a much harder problem than an LLM because you've got the entire physical world and the data is sparse. as you said, and there's lots of complex interactions. On the other hand, it's determinative in a way that LMs are not, right? Like, there is no correct next word necessarily, right? There's like a, it's a best guess as to what the best next word should be, but in the case of weather forecasting, there is a correct prediction to make. And there is a universe of historical data that you can draw upon to do it. So I'm back and forth on whether this is a harder problem or an easier problem than like making a really, really good LLM.

Starting point is 00:33:14 I think, yeah, I think you'd have, I think depending on, you know, what, what, where your allegiance is live, that'll probably be, I think it's probably more of like an opinion question. But yeah, I think you're absolutely right. So the way I think, so it's, the one thing I just would, like, I might say it a different way is that it's still, like, like I said before, it still is fundamentally uncertain from the standpoint of the information that's available to the model. Now, yes, like the physics is truly deterministic underneath, but because the model doesn't, again, see the butterflies or see the little fine scale stuff, from its perspective, it actually is a random process, right? Because if it doesn't know whether the butterfly flapped its wings or not, how could it

Starting point is 00:33:58 know whether the hurricane is going to form or not, right? Now, so from the perspective of the information available to the model, it is also an uncertain random process to some extent. But I, yeah, I think what you're saying is exactly right. So I think the underlying structure of text is random in different ways. It's like, again, I can, I'm going to pause and then I'm going to say a word, furniture. Right. Like, weather doesn't work like that, right? Like, it doesn't, it doesn't just have something pop into its head that's completely different than what's historic, like, present in the historical record, like you pointed out. So I think that the structure of the uncertainty is different in weather. It's, it's more constrained in a way.

Starting point is 00:34:40 So in text, it's, you can imagine, you know, if you're watching, same thing with like video. You're watching a movie and someone's going to come through the door. You have no idea what they're going to be wearing as a shirt, right? It could be wearing blue, red, anything. There's like no way to predict it. And whether you can always have some idea, you just don't know the fine details. On the flip side, weather is an extremely complicated process. This fluid, chaotic fluid system has, you know, interactions from small scales to large scales, and it's happening.

Starting point is 00:35:10 all over the earth at once. So in some sense, instead of just predicting the next word, you're predicting, like, millions of variables at once. So I think you can kind of like, you'd probably better, like, to have this as like a debate over beers with your friends in their LLM lab rather than like something that can be adjudicated just on the basis of these things.

Starting point is 00:35:29 I guess there's one other question on this sort of comparison to LLM world. Notoriously, like the big LLMs are trained on the internet, right? Your training data set is like all words on the internet. And so that's one of the reasons that they've been among the first sort of major AI models in this new wave to commercialize is because there is this gigantic body of training data that you can draw upon. Now we're hearing lots of folks who are in like robotics world, for example, like facing the challenge of there just isn't an equivalent data set. You can like try to train on YouTube videos or whatever, but it's not quite the same thing. In the case of weather forecasting, it seems to me like in theory you have an incredible historical, you can look at,

Starting point is 00:36:09 every historical weather measurement, right? If you had access to that data, if Noah, Noah does, right? Like, every input data point they ever took historically, and then the subsequent next measurement, which dictated what happened after that, I would think that would be an incredibly rich training data set. Am I, well, two questions, am I right about that?

Starting point is 00:36:34 And is that actually available? Yeah, that's a good question. So the first thing I would say is actually it's not just language, right? Like the first big visual neural networks were built, where it came out of after ImageNet, a big corpus of image data. The first language models, like even a decade ago, they were starting to build large text databases. Protein folding, big databases of protein.

Starting point is 00:36:57 So actually you see like, you know, AI and machine learning are still very, very sensitive to the availability of high quality data and large amounts of it. And you kind of like, you know, one of the best ways to advance the field is to go collect high quality data and make it sort of standardized and available. So for weather, I think that there's sort of good things, there's good news and bad news. So for one thing is we were very fortunate when we started. I think all the folks who were working on AI weather forecasting have benefited tremendously from work that was done by the ECMWF, the European Center for Meteorange Forecasting. they built this data set called Era 5.

Starting point is 00:37:37 And they've been building these, you know, era 5, I think was the fifth generation of the era data set. It was a record of Earth's weather going back for decades. And I think it was originally released going back into the 1979, and then they actually opened it to like back into the 60s. And they weren't, they didn't design this data set to be supporting machine learning. I think it was more to just, you know, have an authoritative record of the climate on Earth over, you know, year on year. and it's at like a six-hour resolution

Starting point is 00:38:04 and 25-kilometer spatial resolution, so it's very, very rich. It just happened to be perfect for machine learning for weather, and it was just a really well-curated data set. The folks at ECMWF are just brilliant and sort of organized and systematic, and they had made this available,

Starting point is 00:38:21 and it allowed a lot of people to sort of, you know, stand on the shoulders and build, you know, great new AI methods. Now, one thing is, though, because we had different satellites and different stations over time, It's not actually all, the data set is standardized, but it's not derived from the same underlying observation. So the quality going back in time actually gets worse in terms of it's not as accurate of a record of weather just because, again, as I said before, the input data wasn't as good. Right.

Starting point is 00:38:50 Now, the other thing that's not great about weather is that weather takes a while to happen. So, like, we have to sort of just wait for more weather data to happen to get more data, right? Like, the weather data we have so far, we're sort of stuck with it. Now, like, tomorrow we're going to have one more day of weather data. But, like, when our models are taking six-hour steps, we have to just kind of wait. So you're sort of like, we got a lot of data, but at the same time, there's not much room to get more of the same kind of data. And I think we and a lot of others are now looking to more unusual or under-explored sources of data to support building richer, better models. Right.

Starting point is 00:39:34 Like, is there like a distributed network data? Like, in theory, if you had access to, I don't know, everybody's cell phone, everybody's iPhone, there's probably a bunch of sensors in the iPhone. Like, presumably you could pull something from that that would have some signal for you. Yeah, yeah. You can geek out on all these things. Like the way, my favorite one is like, you know, I have a video doorbell. It sits there and watches weather all day. Right, right.

Starting point is 00:40:01 Like cars, your car, right? Like, it's when you're, you got the, you know, it's got a thermometer in it. It's got like your windshield wipers. Some of them now are, you know, they're rain sensing. So it's sensing rain. Or it's like the lights go on automatically when it's dark. So like they're sensing whether it's cloudy or these kinds of things. So like I get very excited about the possibility of using all these kinds of things.

Starting point is 00:40:21 possibility of using all these kinds of things. The other one that's even weirder is, like, you know, people go on and they tweet about the weather or they, like, talk about the weather on, you know, social media. And, like, those types of observation, those are still observations. Well, you don't know if they're very good. But there's actually a very wide range of pretty unusual under-exported data sources. But even before we get there, like, I think you can start to think about there's a lot of companies that are trying to build very cheap weather stations. People can put them on their roof. Like, these kinds of things could really help both with the kind of core weather forecast and probably a lot of the applications that people want to use weather for.

Starting point is 00:40:56 So my takeaway from that is that despite what I said, there being this like amazing historical record of every weather measurement that's ever been taken, you still feel kind of data poor, right? Or like training data poor, I guess. You're always data poor, right? Like, that's sort of the story of modern AI is you're basically always kind of data poor. Because one of the most incredible facts about modern AI is just how well it scales with data. More data just means better models. And, you know, I was a person who was very skeptical of this. I didn't think that it was going to scale like this. And I would make my sort of logical arguments. But it turns out I was wrong. And I think a lot of people were wrong. And the folks who really understood that data could

Starting point is 00:41:34 really add value, even at extremely large scales, were right and pursued that course and brought us to where we are today. Interesting. Okay. So I guess I want to finish by talking about what might come. Like, if you, if you draw a line forward a few years into the future, I know, you pick, pick your time, three years, five years, ten years, whatever it is, and you and everybody else who's working on AI weather forecasting succeed, where might we be? Like, what might be possible in a few years that's not possible today? Yeah, I mean, I think that, I think there's a lot, you know, weather affects everything. And it's, you know, it has, you know, different things, energy is obviously a very, very sensitive to weather.

Starting point is 00:42:19 Some things are, you know, only kind of loosely affected by weather. So one thing I would like to see, and I think it's very exciting, is a wider range of use cases of weather. So, for example, like we know that even people make different choices about, you know, what to put in their refrigerator or, like, you know, what clothes or whatever. these different choices, they're going to go on a trip what they expect the weather to be, I think that you can start to make more subtle and informed kind of guidance and suggestions for people on the basis of more accurate weather forecasts.

Starting point is 00:42:59 And that's kind of like at the consumer level, but I also think that at the kind of industry level, there could be a huge opportunity. So, for example, in energy, you know, we see there's, you know, if you have a wind farm or a solar farm, you're making forecasts about the weather, and then you're kind of using that to figure out, like, how you know, if you're going to have energy to sell and how you get price it. But I have a feeling that there's a lot of headroom,

Starting point is 00:43:22 a lot more to be gained in how we, you know, plan out our, you know, how to operate our electrical grids, how we predict what the electrical demand is going to be. Is it going to be hot? Is it going to be cold? Is it going to be humid or the air is carrying more, you know, mass, which, you know, requires more energy. to heat, cool. I think that we just haven't really scratched the surface of the opportunities.

Starting point is 00:43:47 I think that supply chains and logistics and even just like lots of choices that, you know, driving and these type of things, I think, really could be better informed by better weather forecasts. And I don't think we've even begun to kind of get into this. And we need to see the quality of the forecasts get better and more customized to these use cases to start unlocking that. The other one obviously is just better crisis handling crises. Like some of our recent work on tropical cyclone forecasting, you know, I'm very proud of. And we think hopefully with better forecasts we can warn people earlier, more accurately. We can, in some cases where there's a disaster, like a wildfire, maybe we can even go and intervene and try to stop it before it gets completely out of control.

Starting point is 00:44:34 So, again, this is not something that we can do today, but I'm very optimistic that, and in general, maybe this is coming through, but I just, we, you know, my team and I really believe in the power of technology that can have a lot of positive benefits. So we often are trying to look for ways that we can, you know, put technology to best use and sort of, you know, leave our kids with a world that's better than the one that, you know, we grew up in. All right, Peter, this was super interesting and useful for me both in the context of thinking about, like, weather forecasting and just like understanding, like, the where, you know, where. AI, how AI is getting applied in various industries, what the challenges are, what the opportunities are. So really appreciate your time. Sure. Yeah, it was great to have, to be here. And your questions were great, too, by the way. It was really fun. Peter Pataglia is the Senior Director of Research at Google Deep Minds Sustainability Program. This show is a production of Latitude Media. You can head over to Latitudemedia.com for links

Starting point is 00:45:28 to today's topics. Latitude is supported by Prelude Ventures. This episode was produced by Daniel Waldorf. Mixing and theme song by Sean Marquan. Stephen Lacey is our executive editor. I'm Shayal Khan, and this is Catalyst.

Catalyst with Shayle Kann - How AI is changing weather forecasting

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.