@HPC Podcast Archives - OrionX.net - HPC News Bytes – 20260601

Starting point is 00:00:04 Welcome to HPC Newsbytes, a weekly show about important news in the world of supercomputing, AI, quantum computing, and other advanced technologies. Hi, everyone. Welcome to HPC Newsbytes. I'm Doug Black, and with me is Shaheen Khan. Huawei claimed last week at an I-Tripple E event in China, what Tom's hardware described as, quote, a sanctions-busting breakthrough, unquote, of being able to produce 1.4 nanometer class chips by 2031, with 55% higher transistor density. This is Huawei's new logic folding chip architecture that the company says can bypass restrictions

Starting point is 00:00:44 on China's access to extreme ultraviolet chipmaking machines those produced by the Dutch company, ASML. Sheen, I'm very interested in your thoughts on Huawei's claim that their Tao scaling law, T-A-U, will replace Moore's law. Is there validity here, do you think? Well, as usual, it helps to see this through the lens of Huawei, the company, and also China, the country. The big announcement and stage presentation is consistent with what we said a few weeks ago, as China's apparent change in strategy to disclose more about their technology advances.

Starting point is 00:01:22 I would not be surprised if they started participating in the top 500 list again this year. What Huawei has done is excellent marketing. They have identified two valid and well-accepted set of strategies, namely a focus on reducing signal delays via 3D design and packaging, package them into nicely labeled brands, positioned them as industry-level laws, and provided a time frame based not on what lithography can do today, but on next-generation lithography. That is a media-friendly storyline, and it's been duly picked up and amplified. But big picture, things carry on as before, and nothing has changed.

Starting point is 00:02:05 China does not have access to advanced lithography. Advanced lithography is needed. It will not be able to develop it indigenously anytime soon and must therefore make up for it through other parts of system design. And that means holistic co-design and co-optimization across the full stack, which is what everyone else is doing too. Now, efficiency drops as the number of nodes grows and slower chips require a larger number of chips and nodes and racks for a given performance. So Huawei would need to optimize across the stack in a tougher optimization zone.

Starting point is 00:02:43 It's usually a bigger and harder task than what you have to do with clusters based on faster chips, but it's been done before. When you look at the entire software stack and algorithms for learning and inference and apps on top of them, especially with today's learning algorithms that rely on heavy communication and shared memory constructs and contrast that with what the hardware needs to do, advances in those softer areas will be more imminent, and of course that's something that the entire industry is pursuing as well. At the chip level, another path for China could be wafer scale chips. That would reduce the number of nodes required for a given overall performance. The Chinese Academy of Sciences has a development project in that area, and that may very well feed into Huawei, but it is not discussed in this announcement.

Starting point is 00:03:34 There are a lot of problems to be solved for wafer scale chips that we have discussed on this podcast before, like connectivity among different parts of the wafer, heat distribution and structural integrity, yield management and routing around failures, etc., etc. It took Cerberus and TSM years to solve them, so that is a difficult path, too. The AI landscape is changing fast, and with it, the compute needs and sources of compute. News broke last week that Cloud Data Storage Company's Snowflake signed a $6 billion five-year deal with Amazon Web Services for AWS AI inference chips, i.e. Gravaton, arm-based CPUs. The deal in part is due to the large amount of corporate data housed in Snowflake Storage on AWS. It also reflects the shift in the AI market toward inference, the move from training, then to daily usage, and now to automation via AI agents,

Starting point is 00:04:36 which means CPU usage is exploding. The announcement came alongside a blowout Q1 earnings report for Snowflake, which beat revenue estimates and raised its full year guidance, which led the stock to surge by 30, Now, in its fifth generation, Graviton 5 can provide 192 cores and deliver some 25% more performance than Graviton 4. Not surprisingly, AWS's Andy Jassy says their CPUs offer plenty of compute at lower price, and apparently, Snowflake agrees. The announcement is not quite just the subscription renewal spun by the vendors, but it is positioned well and spun well nevertheless.

Starting point is 00:05:19 it indicates massive demand for infrastructure and includes CPUs and GPUs, but with heavy emphasis on CPUs. So it simultaneously indicates growth for the companies, shows further adoption of AWS' homegrown graviton CPUs, and sends a signal that going forward, GPUs are maybe not as critical as they have been. In reality, market demand for all technologies is growing and the return of CPU-based computing is a welcome release valve. As for the size of the deal, for the full year 2025, AWS generated $128.7 billion in total revenue. By the end of Q4 2025,

Starting point is 00:06:03 AWS's growth was 24% year over year, pushing its annualized revenue run rate to $142 billion. So $6 billion over five years is obviously huge and one of the biggest deals of its kind, but numbers are big in all sorts of places these days. Snowflake runs on Azure and Google Cloud too, but AWS is the main provider. It was also reported that Snowflake had generated some $7 billion through the AWS marketplace over its history and customers' own spending through AWS on Snowflake,

Starting point is 00:06:39 presumably ramping up fast over the years, also would likely count. And given that software is sticky and many of those customers would continue, and others would join them, the deal seems to be a calculated projection. The Snowflake has also announced multi-year deals with Anthropic and Open AI on the order of $200 million each, along with collaborations with meta-AI and mistral, the French AI company.

Starting point is 00:07:05 And given AI's rapid growth, those deals could grow as well. As we're seeing, the surge in AI inference has led to more companies producing new chips to compete in the CPU market, and that includes the inventor of GPUs. We're talking, of course, about NVIDIA. CEO Jensen Wong announced earlier this month the intent to grow NVIDIA's CPU presence with a chip called VARA, designed specifically for AI. The new VERA CPU rack integrates 256 liquid-cooled Veras and can sustain more than 22,500 concurrent CPU environments. AI factories can quickly deploy and scale to tens of thousands of simultaneous instances and agentic tools in a single rack, according to the company.

Starting point is 00:07:53 And the company said the chip delivers results with twice the efficiency and 50% faster than traditional CPUs. Well, we've been covering the return of CPU-based computing for a while now. A few weeks ago, we described the growing importance of CPUs for orchestration of agentic AI and their on-chip accelerators and therefore use and inference to cut costs. We also talked about China's CPU-only supercomputer a few weeks ago. And we have described Fujitsu's Fugaku and fugu.net next at length with the upcoming Monaco CPU and ability to use GPUs 2 when needed.

Starting point is 00:08:34 So CPU computing is coming back. And fortunately for Invidia, it's taken it long enough that Nvidia can play. It will be a competitive market that includes Intel and AMD, of course, but also Arm itself with a chip they called AGI, no less, an ampere, which seems to be focusing on large deployments, and established homegrown CPUs by AWS, Microsoft, and Google, and then Risk Five players like Sci-5, Ventana, Esperanto, SoftGo, and Alibaba, which can do Risk Five and Arm,

Starting point is 00:09:07 especially through its chip design subsidiary T-Head, T-Hash, head. We'll close with the Wall Street Journal report last week about an open AI model, cracking a well-known math problem, well-known in math circles, that has resisted a solution for 80 years. The problem looks to me something like an optimization problem. The journal article describes it as, quote, if you put n dots on a sheet of paper, how many pairs of dots can be exactly one unit apart? The open AI model solved it and, leading mathematicians are stunned. No less than Timothy Gowers, professor at College de France, and winner of the prestigious Fields Medal, called it a milestone in mathematics.

Starting point is 00:09:54 This achievement is consistent with what renowned mathematician Professor Terence Tao of UCLA has said about the role of AI and math. Tao sees AI as a powerful tool that transforms mathematics by lowering the cost of exploration and accelerating routine workflows. AI excels at large-scale meta-searches and validating, quote, crazy ideas across infinite combinations. But he cautions that raw computer-generated proofs often miss the journey and miss important conceptual insights. And of course, AI remains prone to mistakes, including very subtle, logical hallucinations. So he sees a critical need for formal, verifiable. tools. All that would transform the mathematician to intuitive director of automated logic.

Starting point is 00:10:47 Shaheen, I'll repeat a point that's often made, which is that AI is permanently emerging into something more powerful. So if models are solving problems of this sort now, where will it be in 10 years in 25? Yeah, absolutely. Absolutely, it's coming. Well, as you all know, I believe AI delivers amazing results when A, it has information that humans gave it, but did not know they were giving it, i.e. the information content of what AI ingests is higher than what humans perceive. Or B, you need very high iteration count to incrementally build towards the result, and computers can iterate billions of times. Or C, the result requires the right combination of several things from among billions of possibilities, and AI can sift through them all.

Starting point is 00:11:35 What Open AI has done here is a notable and sophisticated version of this combinatorial approach. Now, a human would have to do what I call a, quote, perceptive shot in the dark, end quote, because the meta space of combinations is so large, the human brain has to use intuition and do some extreme filtering and focus on a few possibilities. When that works, it's genius. When it doesn't, it's a series of dead ends. This is why a quote, crazy thinker can sometimes find the answer by considering a novel unexpected combination. But what AI has done here is described as more than just a sophisticated search.

Starting point is 00:12:17 Even the most powerful supercomputers are set to be easily paralyzed by what they call the combinatorics of pure math, meaning that calculating every combination of mathematical fields is just not possible. The Open AI model was able to consider that right combination of mathematical concepts to begin with and then applied the right logic to synthesize them. It had to use something similar to learned intuition, smart filtering, assigning probabilities to eligible mathematical concepts, and then evaluate many more combinations than a human could. So AI was able to produce a very good short list, which is the impressive part,

Starting point is 00:12:58 even though that list is way, way longer than a human's short list, the chain of thought it produced was 125 pages long, covering many explorations, each of which could take a long time for a human. To paraphrase Professor Tao, AI can be richer and broader, but not deeper. All right, on that promising note, that's it for this episode. Thank you all for being with us. HPC Newsbytes is a production of Orion X, Shaheen Kahn and Doug Black host the show. Every episode is posted on Orionx.net. If you like the show, please rate and review it.

Starting point is 00:13:36 Thank you for listening.

@HPC Podcast Archives - OrionX.net - HPC News Bytes – 20260601

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.