The AI Daily Brief: Artificial Intelligence News and Analysis - Google Cloud Next is All About Agents [Shocker!]

Starting point is 00:00:00 Today on the AI Daily Brief, Anthropic introduced their power user tier. Before that in the headlines, Google Next is all about agents. The AI Daily Brief is a daily podcast and video about the most important news and discussions in AI. To join the conversation, follow the Discord link in our show notes. Hello, friends. Three quick notes before we dive into today's show. First up, as I've been mentioning a couple times, for those of you who are looking for an ad-free version of the AI Daily Brief, you can now head on over to Patreon.com slash AI Daily Brief to find that.

Starting point is 00:00:36 Second, next week is spring break, so you will not not have shows, but they will be a little bit different than normal. Lastly, something I want to gauge people's perspective on. The AI Daily Brief community has been hugely supportive of and important in the superintelligence story. We're considering reserving part of our current round for investors from this community. However, I'm trying to gauge interest. If this is something you think we should explore, send me a note at NLW at B-Super.a.i with super in the title. Thanks in advance for your perspective. And with that, let's get into today's show. Welcome back to the AI Daily Brief Headlines edition, all the daily AI news you need in around five minutes.

Starting point is 00:01:11 We kick off today with Anthropic getting themselves a power user tier. Now, you might remember when OpenAI introduced their $200 a month tier back in December. Many believe that the demand just wouldn't be there. However, while we don't know how popular exactly that subscription is, it's generated enough usage that Sam Altman said that the company was actually losing money on the deal. Anthropic is calling their version of the premium subscription Claude Max. The product allows users to pay to get around Anthropics notoriously troublesome rate limits, gets priority responses during heavier traffic periods, and early access to new features thrown in

Starting point is 00:01:44 as a sweetener. If you've spent any time on AI Twitter, you'll know that people have been constantly asking Anthropic for the ability to pay more for more service. There's actually two different levels of Claude Max. For $100, users get five times the rate limits of the $20 a month pro tier. That math works out. But for those users who really burn through tokens, they'll want the $200 level that allows for 20 times the rate limit.

Starting point is 00:02:06 What's not on offer, unfortunately, for some, is unlimited usage. So is this about boosting revenue or simply recognizing that power users are under-served? As they say in the business, why not both? Anthropics product lead Scott White said that the company isn't ruling out adding even more pricey subscriptions, saying we could see a $500 a month level. Ultimately, he said that the product roadmap is guided by user feedback. And the one loud, consistent piece of user feedback we've seen over the past year is that power users want to pay for more usage.

Starting point is 00:02:34 Next up, a little more on the tariff fallout, Nvidia has secured a carve-out for their China-based chips. Industry insiders had widely expected the administration to clamp down on export of H20 chips, which is the downgraded GPUs that are designed to get around export controls. NPR is reporting that the additional restrictions won't go ahead after Jensen Huang attended a million-dollar-a-head dinner at Mar-a-Lago last week. Sources said that restrictions had been in the works for months and were ready to be implemented as soon as this week. They said that the president changed his mind after Huang promised new data center investment in the U.S. Chris Miller at Tufts University History Professor and Semiconductor expert commented,

Starting point is 00:03:09 even though these chips are specifically modified to produce their performance, thus making them legal to sell to China, they are better than many, perhaps most, of China's homegrown chips. China still can't produce the volume of chips it needs domestically, so it is critically reliant on imports of Nvidia chips. Then again, that view is a little up in the air after recent reports of efficient training runs on new Huawei chips. However, even if the Chinese AI industry is trying to wean themselves off of

Starting point is 00:03:31 invidia, the market is still critical for the dominant chipmaker. 13% of Nvidia's official demand comes from China, and that figure could be much higher if you account for evasion of export controls through Southeast Asia. All in all, it adds up to us continuing to not have a coherent picture of the administration strategy when it comes to chip controls. Following inauguration, Trump pledged to wind back many of the restrictions on the AI industry. However, the enhanced export controls introduced in the final weeks of the Biden presidency are still in place.

Starting point is 00:03:59 These regulations put limits on a huge portion of the world, including friendly countries like Israel and India. Then again, this does also seem like reinforcement of the idea that there's always a deal to be made when it comes to Trump. Now, over the last couple of days, you may have heard me wax poetic around what I think the implications of some of this tariff stuff are likely to be on venture capital. Nominally, I think that it's going to be a harder fundraising environment, not only for startups, but also for VCs themselves, but countering that point appears to be Andreson Horowitz, who are reportedly looking to raise a $20 billion AI mega fund. sources said the firm is looking to capitalize on high international demand for investments in American

Starting point is 00:04:34 companies. They added that international LPs view the fund as a way to more easily invest money in the USAI sector without the restrictions. So it sounds like this actually might be playing into and taking advantage of some of the tariffs. Last year, A16Z raised $7.2 billion scattered across the themes of American dynamism, apps, games, infrastructure, and growth. This fund then is both significantly larger and more focused than previous efforts. The gigantic size brings brings up questions about whether venture capital can scale up in this rarefied air. SoftBank is perhaps the obvious comparison. They raised their $100 billion vision fund in 2017 with very mixed results. The second vision fund raised in 2019 is a relatively more modest but still massive $56 billion.

Starting point is 00:05:14 The other comp that brings to mind is Sequoio, who currently manage over $56 billion in assets overall. Still, there hasn't been a venture strategy as capital intensive as AI in the past. In fact, part of the reason that companies like OpenAI had to turn to big tech partners like Microsoft is that there simply wasn't enough dry powder in the venture capital coffers for them to get what they needed. Reuters sources said that a significant portion of the fund would be set aside for follow-on investments in companies already in A16Z's portfolio. And with portcos like mistral and safe superintelligence and data bricks, there is a lot of money to be spent. There is certainly a lot of capital need.

Starting point is 00:05:49 Lastly, today, a little bit of institutional psychodrama. open AI has countersued Elon Musk asking the court to bring an end to the billionaire's legal challenge. The court filing called for Musk to be prohibited from taking, quote, further unlawful and unfair action and held responsible for the damage he has already caused. It stated, open AI is resilient, but Musk's actions have taken a toll. Should his campaign persist, greater harm is threatened, to open AI's ability to govern in service of its mission, to the relationships that are essential to furthering that mission, and the public interest. Musk's continued attacks on OpenAI, culminating most recently in a fake takeover bid designed to disrupt OpenAI's future,

Starting point is 00:06:24 must cease. Musk's attorney immediately fired back. In a press statement, he said, had OpenAI's board genuinely considered Musk's bid as they were obligated to do, they would have seen how serious it was. It's telling that having to pay fair market value for OpenAI's assets allegedly interferes with their business plans. The case is currently at a slow point as the parties away to jury trial next spring.

Starting point is 00:06:45 Musk's attempt to seek an injunction to stop Open AI from converting to a nonprofit was rejected in March. So technically there isn't anything stopping California Attorney General Rob Bonser from making a decision on the conversion. However, complaints continue to roll in and the active litigation gives him a good excuse to delay. Meanwhile, of course, OpenAI has a huge financial incentive to get this wrapped up quickly. The company's latest fundraising round featured $10 billion from SoftBank that is contingent on the conversion being completed by the end of the year. That, friends, is going to do it for today's AI Daily Brief Headlines edition. Next up, the main episode.

Starting point is 00:07:16 Hey, listeners, want a supercharge your business with AI? In our fast-paced world, having a solid AI plan can make all the difference. Enabling organizations to create new value, grow, and stay ahead of the competition is what it's all about. KPMG is here to help you create an AI strategy that really works. Don't wait, now is the time to get ahead. Check out real stories from KPMG of how AI is driving success with its clients at KPMG. com.us slash AI. Again, that's www.kpmg.

Starting point is 00:07:49 Today's episode is brought to you by Vanta. Vanta is a trust management platform that helps businesses automate security and compliance, enabling them to demonstrate strong security practices and scale. In today's business landscape, businesses can't just claim security, they have to prove it. Achieving compliance with a framework like SOC2, ISO-27-01, HIPAA, GDPR, etc. is how businesses can demonstrate strong security practices. And we see how much this matters every time we connect enterprises with agent services providers at super-intelligent. Many of these compliance frameworks are simply not negotiable for enterprises.

Starting point is 00:08:23 The problem is that navigating security and compliance is time-consuming and complicated. It can take months of work and use up valuable time and resources. Vanta makes it easy and faster by automating compliance across 35-plus frameworks. It gets you audit-ready in weeks instead of months and saves you up to 85% of associated costs. In fact, a recent IDC White Paper found that Vanta customers achieve $535,000 per year in benefits, and the platform pays for itself in just three. months. The proof is in the numbers. More than 10,000 global companies trust Vanta, including Atlassian, Cora, and more. For a limited time, listeners get $1,000 off at vanta.com slash nLW for $1,000 off. Today's episode is brought to you by Super Intelligent

Starting point is 00:09:08 and more specifically Super's Agent Readiness Audits. If you've been listening for a while, you have probably heard me talk about this, but basically the idea of the Agent Readiness Audit is that this is a system that we've created to help you benchmark and map opportunities in your organizations where agents could specifically help you solve your problems, create new opportunities in a way that, again, is completely customized to you. When you do one of these audits, what you're going to do is a voice-based agent interview where we work with some number of your leadership and employees to map what's going on inside the organization and to figure out where you are in your agent journey. That's going to produce an agent readiness score that comes

Starting point is 00:09:49 with a deep set of explanations, strength, weaknesses, key findings, and of course, a set of very specific recommendations that then we have the ability to help you go find the right partners to actually fulfill. So if you are looking for a way to jumpstart your agent strategy, send us an email at agent at bsuper.a.i, and let's get you plugged into the agentic era. Welcome back to the AI Daily Brief. In a massive shock to literally no one paying any attention to this space, Google Next is all about agents. Yes, friends, this week we are headed into the next round of big tech conference season,

Starting point is 00:10:25 and unsurprisingly, Google's Cloud Next conference featured a huge lineup of AI announcements, designed to all in some, I think, make the technology feel and be more useful. The company's annual cloud conference was held in Las Vegas this week and was absolutely squarely focused on taking the next step in the AI race. We've got agentic infrastructure, new models, a new AI chip, and much, much more. A pair of announcements about agentic infrastructure could end up having the most impact of all. Back a couple of weeks ago on March 30th, in the wake of OpenAI announcing that they were going to support MCP or the Model Context Protocol, Google CEO Sundarpa Chai tweeted, to MCP or not to MCP,

Starting point is 00:11:03 that's the question. Let me know in the comments. 1.8 million views and a thousand comments later, we got the answer and it was a yes. DeepMind CEO Demis Sabaas tweeted, MCP is a good protocol and it's rapidly becoming an open standard for the AIAgent era. We're excited to announce that we'll be supporting it for our Gemini models and SDK. Look forward to developing it further with the MCP team and others in the industry. And to reiterate, this means that all three of the leading US labs are now supporting MCP. Now, for those of you who don't know what the heck I'm talking about, we covered MCP in depth a couple weeks back.

Starting point is 00:11:34 But in short, if you're just trying to understand the implications, it means a significant boost for interoperability and compatibility across the rapidly developing agentic infrastructure layer. The more people's supporting and building on MCP, the more agent building becomes plug-in-play, and the faster everyone builds on the advances of everyone else. The second big agentic announcement was the unveiling of Google's agent development kit and a new interoperability standard called agent-to-agent. As the name suggests, the standard is seeking to harmonize the way agents communicate with each other rather than how they interact with tools. Rouse, who are up in any, VP of Google's

Starting point is 00:12:07 cloud business application platform, insisted that A2A is not competing with MCP, which of course is more about tool use. They said, we see MCP and A2A as complementary capabilities. The way we are looking at agent to agent is at a higher layer of abstraction to enable applications and agents to talk to each other. So think of it as a layered stack where MCP operates with the LLMs for tools and data. Words, words, words, but basically the point is that they're trying explicitly to not compete with MCP here, but instead be a standard for something different. A2A has 50 companies on board to support the Open Standard, including Salesforce, Service Now and Workday. Surrapanini said that Google isn't necessarily looking to compete with other consortiums working on their own solutions,

Starting point is 00:12:46 saying, we will look at how to align with all of the protocols. There will always be some protocol with a good idea, and we want to figure out how to bring all those good ideas in. Now, the benefits of this sort of standardization are replete. Standard ways for agents to coordinate could reduce the amount of complexity, for example, in multi-agent systems, could be mean that agents talk to their counterparts at other companies, making them more capable of getting work done without getting humans involved. So what does it all amount to? Well, MIT PhD Tobin South says, my take on A2A from Google and friends is that they're trying to create a communications hierarchy with MCP as tool use and A to A as coordination and communication. Frankly, I prefer the client server

Starting point is 00:13:20 model of MCP, and I think we'll see A2A wrapped by an MCP server supporting the A2A schema. HubSpot founder and now agent.aI creator Darmes Shah writes, shockingly, this doesn't change everything. It's very, very early, but here are my initial thoughts. I'm a big believer in multi-agent networks and agent-to-agent communication. It's good that there's now an open standard out there for it. They cover some very key needs. Capability discovery, agents being able to send messages to each other,

Starting point is 00:13:45 being able to work on tasks that are long-lived to async, weaving in human U.X into the agentic flow, etc. This is not a replacement for MCP. In fact, during their announcement post, Google included a helpful diagram that illustrates how A-2A and MCP fit together. Still, this feels a bit heavy to me. It's trying to do a lot. In a way, that's good because you get a bunch of capabilities out of the box like async tasks and user experience negotiation, but the trade-off is that heavier protocols are harder to implement, and as such, you don't get the quick adoption you see with

Starting point is 00:14:11 lighter weight things, so I don't anticipate MCP-style adoption. Reading between the lines, this feels like they're solving for a lot of mega-enterprises and big consulting firms looking to build multi-agent systems inside the corporation. It's less about connecting agents across orgs, but I could be wrong. will be interesting to see actual usable implementations of this outside the Fortune 1000 companies. Overall, this is good news, though, moves us further down the multi-agent systems road. Other random little agentic notes, Gemini Code Assist is getting an agentic upgrade? This is their cursor competitor nominally, and it can now deploy agents to complete complex programming tasks across multiple steps.

Starting point is 00:14:47 This kind of agentic feature has been a game changer on other platforms, with programmers using it to automate repetitive tasks like code migration. Google has also released a security. agent as part of their new unified security platform. The goal is to have an agent on the beat that can recognize and remediate threats before they become major problems. The chief information security officer at Charles Schwab, Bashar Abasido said, Google is transforming security operations and enabling our vision to stay proactive in responding to cyber threats. The platform has empowered our team to focus on strategic initiatives and high-value work. We didn't get a massive new model,

Starting point is 00:15:19 but what we did get was Gemini 2.5 Flash. Like its predecessor, this is a smaller model designed to deliver efficient performance with extremely low latency. The big change is a lot more customization. Google wrote, you can tune the speed, accuracy, and cost balancing for your specific needs. This flexibility is key to optimizing Flash performance in high volume cost-sensitive applications. Gemini 2.5 Flash is a reasoning model at launch and will likely end up being the cheapest on the market. The model is designed to adjust the depth of reasoning based on the complexity of the prompt, which is similar, it seems, to the approach that OpenAI is pursuing. Google is aiming to provide a model that hits the sweet spot between performance and cost writing.

Starting point is 00:15:57 It's the ideal engine for responsive virtual assistance and real-time summarization tools where efficiency at scale is key. Google also announced that they plan to bring Gemini models to on-premise deployments starting in Q3. On the chip side, Google has announced the seventh generation of their tensor processing unit, which they're calling ironwood. Now, unlike GPUs, TPUs are specifically designed for AI computing tasks. GPUs, meanwhile, are more generalized across all mathematical functions. In fact, in the early days, it was a slight coincidence that the chip architecture that powers 3D gaming was also highly performant for AI. The core bet with developing TPUs instead of GPUs was that specially designed architecture

Starting point is 00:16:34 would be more efficient. So far, that theory hasn't totally played out with NVIDIA's GPU architecture still at the top of the pack. However, Google is hoping that Ironwood might finally validate that thesis. The company claims that their new processor can deliver 24 times the computing power of the world's fastest supercomputer when deployed at scale. Previous generations of the hardware were designed for both training and inference, but Ironwood is the first to be specially optimized for inference.

Starting point is 00:17:00 Amin Vodat, Google's vice president of machine learning said, Ironwood is built to support this next phase of generative AI and its tremendous computational and communication requirements. This is what we call the age of inference, where AI agents will proactively retrieve and generate data to collaboratively deliver insights and answers, not just data. Now, there are a bunch of numbers and specs to come with this, this thing, but rather than try to explain an exa-flop, the promise here is that Ironwood is around

Starting point is 00:17:23 twice as fast as Nvidia's H-100. Still, they argue that the biggest difference is actually scale. A maximum-sized pod of Blackwell B-200 chips is 576 chips before they require outside networking, while Ironwood claims to be capable of being deployed in a 9,216 chip pod. This is also a massive jump from Google's previous generation of TPUs, which were called Trillium. The company has achieved a fourfold increase in computing power compared to the 2024. model. Ironwood is also more efficient delivering twice the performance per watt compared to Trilium. And this TPU and the focus on efficient inference definitely suggests that Google is scaling up to service compute-hungry reasoning models and the agents driven by them. Now, there is more

Starting point is 00:18:02 going on here as well. In a big vote of confidence for Google's new Silicon, Iliu announced that his startup will use Google Cloud's TPUs. Samsung announced that Gemini would be added to their new home robot. Google's Enterprise Cloud platform now features a music generation model. Ultimately, while there wasn't one big, huge thing, at least not a hit you over the head, overall it's a pretty remarkable shift from black Nazis and glue on pizza to all of this in just about a year. AI for success writes, Google DeepMind is destined to win the AGI race, and here's why. They have the data advantage. They own the TPUs. Massive distribution channel. Have all the best models right now. They have everything, and they dominate all four key areas.

Starting point is 00:18:41 Applications, foundation models, cloud, accelerator hardware. I'm not sure how this all shakes out, Google, you've got to be happy that that is the narrative among many these days. That's going to do it for today's AI Daily Brief. Appreciate you listening or watching as always. And until next time, peace.

The AI Daily Brief: Artificial Intelligence News and Analysis - Google Cloud Next is All About Agents [Shocker!]

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.