The AI Daily Brief: Artificial Intelligence News and Analysis - Nvidia’s Blackwell is Radically More Powerful AI Compute

Episode Date: March 19, 2024

Nvidia's new Blackwell GPU announced at the GTC conference revolutionizes AI computing with unmatched power and innovation. As the AI industry's backbone, Nvidia remains pivotal, unveiling Blackwell: ...an AI super chip boasting 208 billion transistors and unprecedented speed. Plus the Department of Homeland Security starts a set of AI pilots. ABOUT THE AI BREAKDOWN The AI Breakdown helps you understand the most important news and discussions in AI.  Subscribe to The AI Breakdown newsletter: https://theaibreakdown.beehiiv.com/subscribe Subscribe to The AI Breakdown on YouTube: https://www.youtube.com/@TheAIBreakdown Join the community: bit.ly/aibreakdown Learn more: http://breakdown.network/

Transcript
Discussion (0)
Starting point is 00:00:00 Today on the AI Breakdown, Invidia announces their new Blackwell system. Before that on the brief, the Department of Homeland Security announces its own AI pilots. The AI breakdown is a daily podcast and video about the most important news and discussions in AI. Go to Breakdown.net network for more information about our Discord, our newsletter, on our YouTube channel. Welcome back to the AI breakdown brief, all the AI headline news you need in around five minutes. One of the things that I've often noted on this show is the sort of disparity between, on the one hand, the talky side of Washington, D.C., the politicians, Congress, the Senate, etc., running a seemingly never-ending set of meetings and discussions and hearings on AI legislation without ever really doing anything
Starting point is 00:00:45 about it, as compared to the U.S. military establishment, as well as private agencies, who are just moving ahead full steam. For example, we talked about how the State Department is hosting a military conference on AI this week, and yesterday the Department of Homeland Security also announced three pilot programs around generative AI. So what are those three efforts? The first is basically to use AI to better extract information from the huge volume of investigative reports and other materials that the department creates. As Axios sums up, investigators will be able to use LLMs to more quickly summarize investigative reports and to improve the process of searching through reports. The agency hopes the pilot program will improve detection of fentanyl-related networks
Starting point is 00:01:22 and help identify perpetrators and victims of child exploitation. The second initiative is a training focused initiative that is going to attempt to use generative AI to personalize materials to help train immigration officers more effectively, and in a way that is more up to date with changing legal issues and policies. Finally, they are trying to simplify and reduce the time cost around resilience and disaster planning with FEMA, making it easier, for example, for communities to submit grants and gain funding in these areas. The New York Times characterized Homeland Security as embracing AI and says, the agency will be the first in the federal government to roll out a comprehensive plan to integrate the technology into a variety of uses. Said Alejandro Mayorkas,
Starting point is 00:01:58 the Secretary of DHS, one cannot ignore it. And if one isn't forward leaning and recognizing and being prepared to address its potential for good and its potential for harm, it will be too late. And that's why we're moving quickly. So how big is the scope of these initiatives? Well, for some reference point, DHS employs 260,000 people for these new AI pilots, they're planning to hire 50 AI experts and spend $5 million overall. Meaning, in other words, that these are relatively small efforts. However, I think the fact that they are doing a full court press strategy around this indicates where it sits in terms of a priority. As part of the initiative, they'll be working with OpenAI, Anthropic, and Meta, and will also use Microsoft, Google, and Amazon Cloud in these pilots.
Starting point is 00:02:36 Basically, it sounds to me like they're testing everyone credible to see whose tools are the most helpful. Importantly, this is not something that's meant to be drawn out. The agency has to report on the results of the pilot programs by the end of this calendar year. I think it will probably not surprise you that my very strong assumption is that we're going to see a lot more efforts like this coming from basically every area of government. One additional area of overlap between the AI industry and the U.S. government may be in combating spying and cybersecurity issues. Another piece from Axios today is called insider threats are AI developers next hurdle.
Starting point is 00:03:08 They write, AI developers hiring quickly to keep pace with market demand are struggling with a new threat, spies and employees looking to steal company secrets. USAI companies are likely already prime targets for nation-state adversaries espionage campaigns. Experts predict that AI developers could become even bigger threats than shipping. manufacturers and biotechnology companies. Of course, we recently covered the Justice Department's indictment of a former Google Software Engineer for stealing AI secrets and sharing them with two Chinese companies. Seems to me like as the U.S. government gets more interested in this area, this could be a point of overlap for these companies. One more vaguely U.S. government-related story. The United States
Starting point is 00:03:42 Securities and Exchange Commission has recently at least partially moved on from its incredulity around the cryptocurrency industry to focus on warning of AI-related scams. Reuters reports that the SEC has now find two investment advisors over their AI claims. The SEC said that Toronto-based Delphalia Inc. and San Francisco-based Global Predictions Inc. agreed to pay a combined $400,000 in fines to settle civil charges related to AI washing. Wrights Reuters, the SEC found that from 2019 to 23, Delphia made false and misleading statements in SEC filings, a press release and on its website, over its purported use of AI and machine learning. Global predictions did the same in 2023 on its website and in social media. As part of the settlement, neither company was forced to
Starting point is 00:04:21 admit or deny the SEC's charges. Given the small amount of the fines and the nature of the settlement, to me it just looks like a ratcheting up of the warning shots from the SEC around other investment advisors throwing around AI language too loosely. Over in Big Techland, YouTube has announced a new tool that will allow creators to self-label when their videos contain AI-generated material. Basically, this is a new optional checkbox that comes up as creators are uploading and posting their content, which asks them to disclose whether there is altered or synthetic content that seems realistic. Examples given include making a real person say or do something they didn't, altering footage of real events and places, or showing a, quote, realistic looking scene that didn't
Starting point is 00:04:56 actually happen. The flip side is that they are distinguishing for things like beauty filters, special effects, and clearly unrealistic content like animation. Right now, these disclosures are voluntary. Finally, today, an interesting chart on which jobs are most likely to be impacted by AI, with the source being a World Economic Forum report called Jobs of Tomorrow. At the top of the heap, that report found that 73% of IT tasks could be automated or significantly altered, 70% of finance tasks, 67% of customer sales, 65% of operations, 57% of HR, 56% of marketing, 46% of legal, and 43% of supply chain. Of course, one of the big questions, when it comes to how AI does ultimately impact jobs, is whether the disruption will be on the task level or on the actual job level.
Starting point is 00:05:42 In other words, will this change what people do, or will it straight up for place people or some combination thereof. This will obviously continue to be one of the most important questions in terms of society's relationship with AI, but for now, that is going to do it for the AI Breakdown. Up next, the main AI breakdown. Welcome back to the AI breakdown. It's hard to overstate how unique Nvidia's position in the AI industry is. It's rare, first of all, to have a company so singularly associated with a new technological revolution, particularly an infrastructure company like Nvidia. But there's no denying that when people think of AI right now, the only company that might have some more mindshare mental association is OpenAI because they were
Starting point is 00:06:28 the ones who introduced ChatGBT, BT, which got this whole party started. Still, when it comes to the world of mainstream finance and just the mainstream in general, Nvidia is probably the company most associated with the generative AI transformation. It has over the course of the last year become the third most valuable public company, and frankly it shows no signs of slowing down. This week, Nvidia is holding its GTC conference. This is an event that used to just be for hardcore computer engineers and for Nvidia customers, but now has taken on the trappings of a festival. Extending that analogy, the headline act yesterday was, of course, CEO Jensen Huang,
Starting point is 00:07:02 whose announcement of Nvidia's new GPU platform Blackwell did not disappoint. Zai Rahul sums up. Invidia just announced Blackwell, the most powerful GPU in the market. Main features? AI Superchip, 208 billion transistors. second-generation transformer engine, fifth-generation NVLink, RAS engine, 100% in-system self-test, secure AI full-performance encryption, decompression engine 800 gigabytes a second. That all sounds like Greek to you. Don't worry because the upshot comes to the next line when Si writes, analysts estimate this could
Starting point is 00:07:31 potentially be 10 to 100x faster than Nvidia's current hopper slash A100 GPUs for very large transformer model workloads requiring multi-GPU scaling. This represents a monumental leap in scale for accelerating the trillion parameter AI future Nvidia is targeting. In Jensen's presentation, he showed one chart in particular that just did a great job of showing the exponential growth of AI compute. The headline reads 1000x AI compute in eight years and goes back to 2016 when Nvidia introduced Pascal at 19 terraflops. The next year, 2017, saw Volta with 130 taraflops. 2020 ampere comes with 620. Hopper, the next great update, what we've been working on, 22, with 4,000 tarflops, and now Blackwell with 20,000 tarflops. CEO Jensen really summed it up very succinctly when he said in this keynote, Hopper is fantastic,
Starting point is 00:08:21 but we need bigger GPUs. And if the compute capacity of Blackwell is a lot of the initial focus, NVIDIA is also clearly trying to tell a different story about its future, right? CNBC. See, Nvidia executives say the company is becoming less of a mercenary chip provider and more of a platform provider, on which other companies can build software. It said Jensen Huang, Blackwell's not a chip, it's the name of a platform. Invidia's enterprise VP Manavir DOS extended this conversation. The sellable commercial product was the GPU, and the software was all to help people use the GPU in different ways.
Starting point is 00:08:52 Of course we still do that, but what's really changed is we have a new commercial software business now. The software that they introduced was called NIM, and the basic idea is to make it easier for people to deploy artificial intelligence applications. Again from CNBC, VP DOS said NVIDIA's new software will make it easier to run programs on any of NVIDIA's GPUs, even older ones that might be better suited for deploying but not building AI. Said DAS, if you're a developer, you've got an interesting model you want people to adopt. If you put it in a NIM, we'll make sure that it's runable on all our GPUs so you reach a lot of people. Now, if you are interested in
Starting point is 00:09:23 some of the more technical parts of the announcement, there's tons that you can go find on that. For example, in his presentation, Jensen explained how the Blackwell GPU is the first chip to combine two separately manufactured dyes into one chip. There were, of course, a lot of other announcements from this presentation as well. We're going to talk specifically about Omniverse, which is software that creates digital twins of real-world items, and is slated to come to Apple's Vision Pro, as well as Project Groot, which is infrastructure for humanoid robots. First, though, I want to talk about the market reaction to this, because again, part of what
Starting point is 00:09:52 has put Nvidia at the center of the conversation is the fact that it has become the standard bearer for AI stock performance. On the one hand, as Bloomberg points out, the announcement of these new trips was widely anticipated, which, quote, made it hard for the presentation's details to impress investors, who sent the shares down about 1.6% in pre-market trading on Tuesday. Bloomberg gets it part of why that might be. They write, for all its success, NVIDIA revenue has become highly dependent on a handful of cloud computing giants, Amazon, Microsoft, Google, and Meta.
Starting point is 00:10:19 Those companies are pouring cash into data centers aiming to outdo their rivals with new AI-related services. The challenge for NVIDIA is broadening its technology to more customers. And that, of course, is where we get some of this additional focus on Nvidia becoming a platform and its software business. And yet, if the immediate response of investors was not to send Nvidia stock to the moon, there is still a sense of long-term strength. For example, Goldman Sachs raised their price target to 1,000, saying we came away from the keynote with renewed appreciation of Nvidia's unique ability to innovate at data center scale, their large ecosystem in breadth of its customer and partner engagements, and ultimately compelling position is one of the key enablers and beneficiaries of the ongoing buildout of the generative AI infrastructure.
Starting point is 00:10:57 Even more than just Nvidia alone, this week we've been seeing the same back and forth that we had throughout last year where macro wobbles compete with AI enthusiasm to reign supreme in terms of how Wall Street is thinking about the markets. Indeed, Reuters wrote a piece yesterday called Wall Street ends higher. Investors juggle fed nerves with AI enthusiasm. Investors, they wrote, were torn between enthusiasm about the prospects for AI on the technology sector and worries about the Federal Reserve's policy update on Wednesday. Said Lindsay Bell, chief strategist at 248 Ventures, This is a market that really wants to hold on to the momentum trade, but what's weighing on investors' minds is what happens with the Fed this week.
Starting point is 00:11:32 Let's talk briefly, though, about some of these other parts of the announcement. Jeremy Dalton, the head of immersive technologies at PWC writes, Nvidia just announced it as bringing its Omniverse platform to the Apple Vision Pro, allowing complex 3D assets to be streamed directly to the headset for further design, assisted by generative AI. By doing this, you don't need to worry about storage or be constrained by the device's processing power. VR filmmaker and YouTuber Hugh Howe writes, Omniverse shows the capabilities of spatial computing by blending 3D photoreal environments with the real world.
Starting point is 00:11:59 Bilal Situ is at the event and writes, I got to try this demo in person today. Mind blown. Immaculate streaming quality, full-res CAD models with real-time ray tracing. Billowell shared the same demo video, which for those of you listening to the podcast, involves modifying a car. And when someone asked you were able to go in the car and still see the room around you, Billowal responded, yes, there's a mixed reality mode and it was surprisingly good. Perhaps even more exciting to many,
Starting point is 00:12:23 with something called Project Groot. Dr. Jim Fan, who I often quote on this show, writes, Today is the beginning of our moonshot to solve embodied AGI in the physical world. I'm so excited to announce Project Grute, our new initiative to create a general purpose foundational model for humanoid robot learning. The group model will enable a robot to understand multimodal instruction such as language, video, and demonstration, and perform a variety of useful tasks. We're collaborating with many leading humanoid companies around the world so that Groot may transfer across embodiments and help the ecosystem thrive. Announced in Jensen's keynote, Project Grute is a cornerstone for the Foundation Agent Roadmap of the newly founded Gear Lab. At Gear, we are building
Starting point is 00:12:57 generally capable agents that learn to act skillfully in many worlds virtual and real. Nate Barcy tried to sum up, Grut will enable a robot to understand multimodal instructions like language, video, and motion. Very soon we will see them cooking, preparing coffee, in supermarkets, changing tires, etc. So what makes something like Project Groot different than, for example, the figure O1 robot that we've talked about recently, is that this is meant to be a system for many different robots, not just one humanoid robot that Invidia is building. So it's coming at the problem in a very different way, assuming a future in which there are many different types of humanoid robots. One additional note on Grute, Invidia also announced a dedicated chip called the Jets and Thor
Starting point is 00:13:33 chip that's specifically designed for humanoid robots. Writes Venturebeat, to make sure humanoid robots can run complex multimodal models like Grut, Nvidia has launched the Jets and Thor computing platform for humanoids. Based on the company's Thor SOC, the computer includes a high-performance CPU cluster, and next generation GPU based on the Nvidia Blackwell architecture, with a transformer engine delivering 800 terraflops of 8-bit floating point AI performance. I think if you zoom out, one of the remarkable things about this really is just how unusual it is to have a hardware company like this,
Starting point is 00:14:02 not even a consumer hardware company, but an infrastructure company, be so frankly hot and at the center of a transformational technology. It's a fascinating thing to watch and something that doesn't show any signs of slowing down. If and as there are more interesting announcements from GTC, I am sure we will be covering them, but for now, that is going to do it for the AI breakdown. Until next time, peace.

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.