The AI Daily Brief: Artificial Intelligence News and Analysis - Amazon Joins Google and Microsoft in AI Nuclear Buildout

Episode Date: October 18, 2024

Amazon joins Google and Microsoft in a major push toward nuclear energy to meet AI’s growing power needs, committing to 600 megawatts in nuclear projects. The announcement highlights how big tech is... relying on small modular reactors to meet climate goals and sustain the energy demands of AI advancements. Concerned about being spied on? Tired of censored responses? AI Daily Brief listeners receive a 20% discount on Venice Pro. Visit ⁠⁠⁠https://venice.ai/nlw⁠⁠⁠ and enter the discount code NLWDAILYBRIEF. The AI Daily Brief helps you understand the most important news and discussions in AI. Subscribe to the podcast version of The AI Daily Brief wherever you listen: https://pod.link/1680633614 Subscribe to the newsletter: https://aidailybrief.beehiiv.com/ Join our Discord: https://bit.ly/aibreakdown

Transcript
Discussion (0)
Starting point is 00:00:00 Today on the AI Daily Brief, Amazon joins the nuclear party, while NVIDIA continues its move to other parts of the AI stack. The AI Daily Brief is a daily podcast and video about the most important news and discussions in AI. To join the conversation, follow the Discord link in our show notes. I was at an AI dinner last night featuring investors, some big enterprises and a number of different startup entrepreneurs. And one of the big points of discussion was just how dominant in Vindia really is. Now, part of the conversation is, of course, how they are the company that has made the most money out of the whole AI revolution so far, but another part of this was how they are quietly moving across the entire stack. In other words, they are not content to just be the
Starting point is 00:00:46 chips underneath everything they are working on basically all the other pieces of AI as well. Lending credence to that conversation, Nvidia quietly released a new open source model that seems to be blowing away the benchmarks. Called the Lama 3.1 Nemotron 70B instruct, the model was uploaded to Hugging Face without fanfare on Tuesday. The model apparently performs better across major benchmarks than OpenAI's GPT40 and Anthropics Claude 3.5 Sonnet. As you can probably tell from the name, the model is a refinement of META's open source Lama 3.1 model. The training featured RLHF for reinforcement learning from human feedback, and overall Venturebeat writes, Nvidia's latest model release signals just how fast the AI landscape is shifting. While the long-term
Starting point is 00:01:24 impact of Lama 3.1 Nemotron 70B instruct remains uncertain, its release marks a clear inflection point in the competition to build the most advanced AI systems. These recent releases, particularly the open source NVLM project, have shown that Nvidia's AI ambitions go beyond just competing. They are challenging the dominance of proprietary systems like GPT40 in areas ranging from image interpretation to solving complex problems. Now, as the model has only been out for a couple of days, people are still in the early stages of figuring out what it can do. Still, as Venturepeat points out, the model correctly answered the question how many R's are in strawberry with a detailed and accurate response, which is a classic challenge for LLMs.
Starting point is 00:02:01 Now, not everyone is convinced. X user Centex reran the benchmarks and commented that it wasn't quite as good as claimed, but still, this brings up really interesting questions about the commoditization of models. Will companies and individuals always be willing to pay for the state of the art, or will we reach a point where cheap but good enough models are frankly good enough? Speaking of competition in areas other than the state of the art, Mistral has released a new set of models that are optimized for laptops and phones. They are calling it Le Ministrao and released two models 3B and 8B, both of which have a context
Starting point is 00:02:32 window of 128,000 tokens, and which they write were, quote, built to provide a compute efficient and low latency solution for local privacy first inference for critical applications such as on-device translation, internetless smart assistance, local analytics, and autonomous robotics. Ultimately, there's nothing huge here other than a reinforcement of the larger trend lined, which is that even as competition at the state of the art increases, so too does competition for these smaller, cheaper models as well. Moving over into the world of energy,
Starting point is 00:03:01 which, by the way, will also be the subject of our main episode, little-known data center startup Crusoe Energy has formed a $3.4 billion joint venture with Blue Owl Capital to build a 200-mawatt data center in Abilene, Texas. Once complete, the facility will house one of the largest supercomputers in the world. The press release was a little coy, stating the project is 100-per-long-term, leased to a Fortune 100-Hiperscale tenant,
Starting point is 00:03:22 with occupancy expected to begin in the first half of 2025. The design will be optimized for direct-to-chip liquid cooling and will also accommodate air cooling. At completion, the data center will be able to operate up to 100,000 GPUs on a single integrated network fabric, quote, advancing the frontier of data center design and scale for AI training and inference workloads. That description could only fit the new data center being built by Oracle for use by OpenAI, which is exactly what the information reported. For some, the curious part of the story is why a project so big would choose a relatively obscure company to design and build it. Crusoe Energy began as a specialist in building Bitcoin mining facilities.
Starting point is 00:03:54 There are a number of big differences, but one of the things Crusoe became known for was cutting-edge cooling systems. It sounds as though this new AI cluster will leverage a number of different experimental features, so perhaps Crusoe is the only company with the technical chops to pull it off. It also sounds as though the project will be highly unconventional in a number of other ways. The deadline is extremely tight with OpenAI wanting to fire up the GPUs by early next year. Crusoe described this as a, quote, record-setting construction timeline. The press release added, The notional power plan for the site includes both on-and-offsite renewable resource,
Starting point is 00:04:24 including surrounding wind developments and a potential future large-scale on-site solar installation. The goal is to optimize existing renewable power resources and incentivize new greenfield renewable power development. It sounds then as though the facility will tap into local renewable energy projects that have been constructed but are not yet connected to the grid. This was another specialty of Crusoe in the Bitcoin mining space. The firm had its origins in deploying shipping containers filled with miners adjacent to oil wells, capturing methane waste to power their generators. For all the discussion in the Web 3 world of the intersection of AI and Web 3, so far it seems like the biggest overlap is taking advantage of the data center
Starting point is 00:05:00 expertise of the mining adjacent companies to take on projects that no one else would be crazy enough to take on. One more today, Photonic Computing Startup Light Matter has raised $400 million to solve one of the biggest bottlenecks in next generation data centers. Their product is a high bandwidth switch that uses light waves rather than wires to pass communications between GPUs in a training cluster. Theoretically, this could be a much faster way to connect the units together. Currently, the top-end data centers use 100,000 GPUs, but they all need to be connected together, allowing them to function somewhat like a single machine. This scale is already banging up against the limits of current technology and performance issues
Starting point is 00:05:34 are going to start to hinder further advances. Light Matter CEO Nick Harris said, Hyper-scalers know that if they want a computer with a million nodes, they can't go do it with Cisco traditional switches. Once you leave the rack, you go from high-density interconnect to basically a cup on a string. Now, the current state of the art is NVIDIA's NVL-72 platform, which wires together 72 blackwall units in a single rack. The issue comes from the connections between racks. Harris added, for a million GPUs, you need multiple layers of switches and that adds a huge latency
Starting point is 00:06:00 burden. You have to go from electrical to optical to optical. The amount of power you use and the amount of time you wait is huge, and it gets dramatically worse and bigger clusters. Harris is convinced his company can deliver, and it seems like that conviction is hard one. He said finally, Photonics is coming way faster than people thought. People have been struggling to get it working for years, but we're there. After seven years of absolutely murderous grind. That's going to do it for today's AI Daily Brief Headlines edition. Next up, the main episode. Today's episode is brought to you by Plum. Generative AI promises to supercharge your productivity and give you superpowers, but if you're not an engineer, trying to harness AI
Starting point is 00:06:38 can be incredibly frustrating. Hours wasted wrestling with complex tools only to give up when they don't work. We all have tedious tasks we'd love to automate and challenges AI could solve, but few of us have the skills to fully leverage these game-changing technologies. That's where Plum comes in. The mission? To make automating your work feel like magic. Imagine typing out AI, read my Gmail and ping me in Slack when something critical comes in, and watching it come to life before your eyes.
Starting point is 00:07:01 No coding required. Whether you're a marketer, salesperson, or founder, Plum enables you to create custom AI workflows in minutes, not hours. Check out UsePlum.com, that's Plum with a B, for early access to the future of Workflow Automation. Today's episode is brought to you by Venice. AI companies, store your entire conversation history and attach it to your identity forever. That's every question you ask, every answer you receive, every image you generate, every thought
Starting point is 00:07:26 you share with the machine it's all being spied on. If you trust all the companies, hackers, and NSA board members that will ever have access to your AI conversations, then rejoice, for you are well served. For the rest of us, Venice is an alternative. Venice is a powerful AI app for text, image, and code generation that respects you as a sovereign individual, and believes privacy and free speech are not only human rights, but necessary for civilizational advancement. Private, permissionless, and uncensored, you can try it for free without an account. AIA Daily Brief listeners receive a 20% discount on Venice Pro. Visit venice.a. slash NLW and enter the discount code, NLW Daily Brief. That's NLW Daily Brief. All one word.
Starting point is 00:08:04 Today's episode is brought to you by Super Intelligent, which is of course our platform that helps you learn how to use AI tools and perhaps even more importantly, gives you ideas on the best use cases that are actually going to help you achieve whatever it is you want to achieve. To recognize the end of summer and back to school slash back to work, we are running our best promotion ever. When you sign up for Super Intelligent, using code so back, your first month will be 100% free. The platform features over 600 fun, highly practical AI tutorials that get you using AI fast and with an eye to actually transforming how you get things done. We've just launched super for teams. So if you have a group of people at your
Starting point is 00:08:46 company that want to figure out how to use AI together, I highly suggest you check it out. But for those of you who are using Superintelligent as an individual, once again, if you sign up for Superintelligent between now and the end of the month using code so back, you will get your first month 100% free. Go to be super.aI and check it out today. Today we are talking about something that at this point is absolutely undeniably a trend, and that is, of course, AI going nuclear. Now, one of the things that is very clear in has been clear for a while is that one of the bottlenecks and constraints on the continued buildout of AI was ultimately going to be power, especially as we push further and further into
Starting point is 00:09:26 the frontier, the need for energy for compute for data centers just goes up and up and up. Now, nuclear has long been discussed as a potential answer to that challenge, and it's finally starting to happen in a real way. In September, we got word that Microsoft was making a deal to help restart Three Mile Island, and earlier this week, Google announced a deal with Cairo's to purchase nuclear energy for multiple small modular reactors. Amazon has now joined Microsoft and Google in making a big commitment to nuclear power with the company announcing three deals on Wednesday, each focused on the construction of small modular reactors.
Starting point is 00:10:01 Two of the deals are partnerships with local energy companies in the Pacific Northwest and Virginia to build out new generation, while the third is a $500 million investment in nuclear energy startup X Energy, who are developing pebble bed SMRs. The Pacific Northwest project will serve as the test bed for this new technology. All in, Amazon is looking to partner in the development of around 600 megawatts of nuclear power across the two sites, with room for additional expansion over time. Amazon said that they expect the reactors to be up and running in the early 2030s,
Starting point is 00:10:29 and overall, X energy is seeking to bring 5 gigawatts of power online by 2039. Like Google's nuclear announcement from earlier this week, Amazon presented the news as a climate initiative. Matt Garman, CEO of Amazon Web Services said, Nuclear is a safe source of carbon-free energy that can help power out operations and meet the growing demands of our customers while helping us progress towards our climate pledge commitments to be net zero carbon across all our operations by 2040. One of the fastest ways to address climate change is by transitioning our society to carbon-free energy sources, and nuclear energy is both carbon-free and able to scale, which is why it's an important area of investment
Starting point is 00:11:03 for Amazon. Our agreements will encourage the construction of new nuclear technology that will generate energy for decades to come. Now, what's starting to become really clear is that the big tech firms believe that the only way that they can meet both their emissions targets as well as their need for increased consumption is the full embrace of nuclear power. These recent announcements have completely cast off any hesitancy around nuclear that was common even up to just a few years ago. Now, it's a little bit out of scope for this particular show. The Cliff Notes version of SMRs is that the idea is to use new designs that have less risk of meltdown. Amazon and Google are each choosing a competing design. The other big point in addition to risk is that SMRs should be
Starting point is 00:11:39 quicker and cheaper to build out than conventional nuclear power plants. But another interesting to mention of this is how big tech going full force into nuclear power might change the way that the U.S. thinks about infrastructure investment in the green transition. Nuclear is currently a little under 20% of the U.S. power supply. The last new reactor was built in Tennessee in 2016. That construction commenced all the way back in 1973 and was put on hold for 32 years until 2007. That means that even once work was restarted, it still took almost a decade to bring the reactor online. Most U.S. nuclear power plants were constructed between 1970 and 1990, and typically have a 50-year lifespan. That means that even if electricity demand remains steady, there would still
Starting point is 00:12:18 be a need to refurbish or replace the entire fleet by 2040. There's been a recognized need to move on this issue for some time, but until recently there hasn't been any urgency. Electricity demand has basically been stagnant in the U.S. for decades, and tight regulations have made the development of new reactors a challenging proposition. This goes double for new reactor designs. A new design hasn't been approved by U.S. regulators in over 50 years until recently. Contrast the situation to other nations. China has built 55 nuclear reactors over the past 10 years, bringing on more than 34 gigawatts of new power. They have another 23 under construction. India has plans to build 18 plants over the next 10 years. Basically, for some time now, there has been a nuclear renaissance happening around the rest of the
Starting point is 00:12:57 world, and it appears that AI is bringing it to the U.S. Perhaps the key to making this happen is stable demand. The hyperscalers are signing these nuclear deals committing to be anchor customers. That is, they are committing to purchase enough energy to make the new reactors viable all in their own. This, in many ways, was the missing piece for U.S.-based nuclear. New reactors are extraordinarily costly, so without guaranteed stable demand in place, they can't be built. Now, another part of the story is, of course, the larger shift in narrative. Mark Nelson on X writes, Sierra Club last month quietly reversed 50 years of anti-nuclear advocacy to endorse nuclear energy as a clean source of power.
Starting point is 00:13:31 In their new electricity policy report, they call for using nuclear to protect the climate. Now, I think one could argue that this is actually more like a tacit acknowledgement that it's necessary rather than an overt reversal of their position, but still it shows just how inevitable this shift is. Rudy Havinstein on Twitter also points out how this is a broader shift as well. He writes, it's odd how for decades anyone pro-nuclear was some sort of anti-baby seal far-right maniac, but once Microsoft and Amazon needed, it's okay. Many in the AI community are just focused on the fact that this is yet another indication of the hyperscaler's belief in what the future looks like.
Starting point is 00:14:04 Bindu Ready writes, all the cloud vendors anticipate a massive surge in AI inference and are ensuring they will have energy in the future. Key takeaway, AI training and inference will grow exponentially over the next decade, and it will require a lot of power. Ethan Malik points out, the Google deal to acquire small nuclear reactors to power data centers increases the odds that we will see AI model scale through at least three more generations in orders of magnitude post-GPT5 to 2030.
Starting point is 00:14:27 According to Epic, power was the binding constraint. Lori Mila Verta, however, points out something important about this discussion. She writes, The story on electricity demand growth from AI and data centers is ridiculous, overblown. Excellent graph from the new IEA World Energy Outlook. Data Centers projected to drive about 5% of electricity demand growth. Please focus on the real challenges and tune out pundits who peddle this story. The point here is that when it comes to real changes in energy use, the bigger story is about the industrialization of the global south, and frankly, the air conditioning that comes
Starting point is 00:14:58 along with wealth growth. However, what makes data centers interesting is that we're talking about the need to grow power supply in the West for the first time in decades. So it is a big story, just not perhaps the big story that it seems sometimes. Overall, it is a super interesting moment and yet one more indicator of just how triple quadruple quintupled down the big companies are on the AI future. That's going to do it for today's AI Daily Brief. Appreciate you listening or watching as always. And until next time, peace.

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.