The AI Daily Brief: Artificial Intelligence News and Analysis - AI Skepticism is CANCELLED

Starting point is 00:00:00 Today on the AI Daily Brief, why AI skepticism is officially canceled. Before that in the headlines, AI moves to production mode. The AI Daily Brief is a daily podcast and video about the most important news and discussions in AI. Hello, friends, quick announcements before we dive in. First of all, thank you to today's sponsors, robots and pencils, Superintelligent, and Vanta. To get an ad-free version of the show, go to patreon.com. Where ad-free starts at just three bucks a month. And of course, to learn about sponsorship opportunities, shoot us in

Starting point is 00:00:34 note at sponsors at AIdailybrief.aI. We will send you all the goods, and you can reach this excellent audience of AI practitioners. With that, though, let's dive in. Welcome back to the AI Daily Brief Headlines edition, all the daily AI news you need in around five minutes. The theme of today, at least where we're kicking off, is AI moving into production mode. One of the big things that I think is happening right now is that all across the industry, we are moving from pilots and experiments, and more than that as activity, that as mindset, into thinking about what is it going to take to bring these technologies to production for big, extensive, important use cases. We're going to talk about that a little bit in the context of agents and how long they can be autonomous in the main episode,

Starting point is 00:01:15 but we kick off with that theme today with the headlines as well, with OpenAI, who have now added full support for MCP tools within ChatGBT. The company wrote, we've finally added full support for MCP. In developer mode, developers can create connectors and use them in chat for right actions, not just search and fetch. Update Jira tickets, trigger Zabior workflows, or combine connectors for complex automations. Now, on some level, this is a small story. This was always coming. OpenAI had already committed to this full support, but it now exists. It's a meaningful improvement to the U.S. And it means users are going to have access to the full suite of connectors and workflows, regardless of how they access these models. The practical effect of that is going to be more

Starting point is 00:01:53 real use cases moving to production. Next up, YouTube is rolling out their AI audio dubbing feature to millions of creators on the platform. Now, this feature has been going through pilots since 2023, including testing with large creators like Mr. Bees, Mark Robere, and Jamie Oliver. Video can be automatically dubbed into dozens of languages, including Hindi, Korean, and Portuguese. With two years of data, YouTube has reported incredible results in delivering English-language content to the globe.

Starting point is 00:02:19 They claim that on average, creators uploading multilanguage audio have seen more than 25% of watchtime come from views in other languages. Jamie Oliver's cooking content saw a 3x viewership boost by making it available in multiple languages. YouTube's next pilot is testing automated thumbnail translations to make that content a more natural fit in other languages as well. Now, between this rollout and Apple's built-in translation for the new AirPods,

Starting point is 00:02:41 it definitely feels like multi-language AI is hitting production quality. The implied cost reductions behind this broad rollout are also notable. In 2023, when the technology was being developed, it may have only made sense to deploy it for videos that get millions of views. Now it is going to be available to millions of smaller creators. Hopefully soon you'll be able to listen to the AI Daily Brief in Spanish, Arabic, or Russian. The Enterprise also continues to get production-grade tools. Stability AI, for example, has released a new version of their audio model, which they say is up to the task of enterprise-grade sound production.

Starting point is 00:03:13 They write, Audio influences brand engagement by 86%, but few enterprises are leveraging audio as an extension of their brand, making customized sound an untapped differentiator. The company's stable audio 2.5 model is capable of generating full songs within seconds. It also has a feature the team is called audio in painting, where users can upload a few bars of a track and let the model fill in the rest based on that context. Stability is also offering to fine-tune the model so companies can dial in the right sound for their brand. Zach Evans, head of audio research at Stability said, 2.5 isn't just an iteration on 2.0. It reflects our shift towards enterprise-grade capabilities, professional quality audio, faster performance,

Starting point is 00:03:51 and the advanced control needed for commercial use cases and the multi-step iterative workflows of creative professionals. Now, control seems key, and this is the same evolution we've recently been seeing with image and video generation. Creative AI workflows have been moving from essentially trying to get lucky with a one-shot generation to iterative AI editing, allowing users to hone in on what they need. All of this adds up to, I think, what we are moving into a much more production-focused era of AI. Now, in this particular set of headlines, there were a bunch of things outside this theme as well, including a couple stories on the regulatory front.

Starting point is 00:04:23 Texas Senator Ted Cruz, perhaps unsurprisingly to you, wants to keep the government's hands off of AI development. On Wednesday, he introduced a new bill called the Sandbox Act. The bill would require the White House Office of Science and Technology policy to create a sandbox for AI model testing with minimal regulatory standards. Companies that participate in the program could then release AI products under regulatory waivers. Now, notably, this is not just about exempting companies from AI regulations. The government could move any regulatory barrier as they see fit. For example, a company working on AI cancer screening software could seek an exemption from HIPAA laws that protect patient privacy. Waivers could be granted for two years at a time or for up to 10 years in total.

Starting point is 00:05:01 The bill is part of a five-pillar plan that Cruz is set to introduce during Wednesday's committee hearing, Cruz said, a regulatory sandbox is not a free pass. People creating or using AI still have to follow the same laws as everyone else. Now, the process of actually getting to federal regulation is going to be extremely controversial. The verge, for example, characterized the bill as letting AI companies set their own rules for up to 10 years, and while that's probably a bit strong, it's going to represent a pretty prominent opinion on how the Sandbox Act would play out in practice. In the meantime, as the government figures out what it's going to do with AI regulation, there are many efforts at some sort of self-regulation as well. One recent example is that a group of

Starting point is 00:05:37 leading web publishers have announced a new standard for content licensing for the AI-first internet. Called really simple licensing or RSL, this new standard builds on top of concepts like robots.t.TXT and the RSS standards for content feeds. A new nonprofit called the RSL collective will be the steward for the standard, with support from Reddit, Yahoo, People, Quora, Medium, and O'Reilly. Said Tim O'Reilly, RSS was critical to the Internet's evolution as an information ecosystem, giving early online publishers a simple open standard to syndicate their content and reach audiences at Internet scale. That spirit of openness is what helped the web thrive. Today, as AI systems absorb and repurpose the same content without permission or compensation,

Starting point is 00:06:15 the rules need to evolve. RSL builds directly on the legacy of RSS, providing the missing licensing layer for the AI-first Internet. It ensures that the creators and publishers who fuel AI innovation are not just part of the conversation but fairly compensated for the value they create. The idea of the new standard is to allow Robots.txti to go beyond simple yes-no permissions for web scraping to define a new automated licensing layer for the internet as a whole. The RSL collective writes, RSL, services supports a range of licensing, usage, and royalty models, including free, attribution, subscription, paper crawl, and paper inference, i.e. publishers get compensated

Starting point is 00:06:49 every time an AI application uses their content to generate a response. There are now two distinct approaches to solving the issue of AI crawlers serving up content for free, with the other being offered by Cloudflare. Cloudflare system includes a proprietary licensing marketplace that handles the required micropayments and has been pretty controversial. The RSL collective is betting on AI companies opting into a well-organized standard instead. Now, obviously, I think this is the beginning and at the end of a conversation, but shows how groups are trying to figure out how to solve some of the potential negative externalities of AI without waiting around for the government to solve it for them.

Starting point is 00:07:22 Lastly today, one more on the fundraising front, Perplexity has officially closed their latest round of funding at a $20 billion valuation. The information reports that Perplexity raised $200 million in fresh capital. The new round comes shortly after the company's previous round in July, which saw $100 million raised at an $18 billion valuation. Sources also said that the company is approaching $200 million in ARR, up from $150 million in August. I.e., the funding environment for leading AI startups is still red-hot and growth isn't showing any signs of slowing down, which I think is a perfect segue to our main episode, why AI skepticism is officially canceled.

Starting point is 00:07:58 AI changes fast. You need a partner built for the long game. Robots and pencils work side by side with organizations to turn AI ambition into real human impact. As an AWS certified partner, they modernize infrastructure, design cloud native systems,

Starting point is 00:08:12 and apply AI to create business value. And their partnerships don't end at launch. As AI changes, robots and pencils stays by your side, so you keep paying. The difference is close partnership that builds value and compounds over time. Plus, with delivery centers across the U.S., Canada, Europe, and Latin America, clients get local expertise and global scale. For AI that delivers progress, not promises, visit robots and pencils.com slash AI Daily Brief. As a founder, you're moving fast towards product market fit, your next round, or your first big enterprise deal.

Starting point is 00:08:44 But with AI accelerating how quickly startups build and ship, security expectations are higher earlier than than ever. Getting security and compliance right can unlock growth or stall it if you wait too long. With deep integrations and automated workflows built for fast-moving teams, Vanta gets you audit-ready fast and keeps you secure with continuous monitoring as your models, infra, and customers evolve. Fast-growing customers like Langchain, writer and cursor trusted Vanta to build a scalable foundation from the start. And look, as someone who lives in the world of enterprise procurement, I love how Vanta makes it easy to get compliance right. The last thing you need when you're trying to win that big deal is to have it scuttled by something that Vanta has solved for over 10,000

Starting point is 00:09:23 companies. Go to Vanta.com slash NLW to save $1,000 today through the Vanta for Startups program and join over 10,000 ambitious companies already scaling with Vanta. That's VANTA.com slash NLW to save $1,000 for a limited time. If you are a regular listener, you will have heard about superintelligence agent ratingness audits at this point. But I wanted to tell you today about the full suite of agent readiness products that go beyond just the initial readiness report. Over the last six months, Super Intelligence has built out an entire agent planning suite. We help you move from discovery to planning to implementation. After you've completed your agent readiness audits, we help you double-click on your most

Starting point is 00:10:06 important use cases with what we call our use case planning reports. These reports are going to help you understand what sort of technical preparation you need to do to be ready for a use case, what challenges you might face in implementation, and whether you should be thinking about building, buying, partnering, or some combination. After that, you can even get a spec document in what we call our technical blueprint that gives either your developers or the developers of the partner you work with what they need to build exactly the agent that you're looking for. If you want to learn more about superintelligence agent planning suite, we've built a custom GPT to answer your questions. Just go to bit.ly slash super super super agent.

Starting point is 00:10:41 That's bit.it.l.ly slash super super agent, all one word. And if you have any questions, questions, the agent can even help you book an appointment with our team. Welcome back to the AI Daily Brief. For weeks now, there has been this skepticism narrative surrounding AI. It has had multiple dimensions, obviously the MIT story, the launch of GPT5, and on Wall Street it's been particularly pronounced. As I've shared, I think at least when it comes to the market dimension of this, a lot of it has to do with AI taking on the generalized anxiety of the larger stock market. But boy, yesterday on Wall Street, AI, skepticism was forcibly removed after Oracle put in one of the most remarkable trading days

Starting point is 00:11:23 in recent memory. Now, we talked already about Oracle's big announcement. On yesterday's show, we covered how they had reported a backlog of $455 billion in contracts over the next five years. The stock was up 27% overnight after the announcement and reached as much as 43% up, ultimately landing at 36% up to reach a new record high. Value at around $600 billion at the beginning of the week, Oracle is now almost a trillion-dollar company. So what happened? This was not a story of Oracle wildly beating analyst estimates. In fact, it was a small miss. Instead, this was all about the future. It was about that $455 billion, in other words, in contracts over the next five years. By the end of the day, we got more information around who was accounting for all of that demand.

Starting point is 00:12:08 Specifically, we learned that OpenAI was the mystery customer who had contracted for $300 billion last quarter. The deal begins in 27 and will run over five years. The contract requires Oracle to deliver 4.5 gigawatts of compute. This is one of the single largest cloud contracts in history and will require the equivalent of two Hoover dams to supply the power. Now, one interesting narrative shift here is the implications of Oracle chairman and founder Larry Ellison becoming the world's wealthiest man. Yesterday's rally in Oracle stock pushed Ellison's net worth up about $100 billion,

Starting point is 00:12:39 maybe the biggest single-day gain in wealth in history. With a net worth of nearly $400 billion now, he's basically neck-and-neck with Elon Musk, and in fact is likely back to second place by the time you're listening, but that's a little bit besides the point. The reason it matters at all is that the title of the world's richest man has tended to follow the dominant tech category of the time. Elon Musk first took the title in 2021 when green tech and EVs were the big narrative. Before that, it was Jeff Bezos on the back of e-commerce.

Starting point is 00:13:04 Prior to Bezos, Bill Gates was the wealth leader from software. Now the world's richest man is an AI infrastructure baron. Now, of course, there is still plenty of skepticism out there. Many of the reports have pointedly noted that open-eastern, AI's current annual revenue was $10 billion in June. The company, therefore, has a long way to go before they can afford to cash flow $300 billion in infrastructure spending. Stock analyst Thomas Chua posted, the entire growth narrative hinges on a cash-burning customer's ability to raise unprecedented amounts of capital. If OpenAI's funding dries up, Oracle could be left with

Starting point is 00:13:34 massive stranded investments in AI infrastructure and a collapsed growth story. There are also plenty of people pointing out that AI revenue is looking a little circular, with OpenAI's revenue flowing into Oracle's balance sheet, which then flows into Nvidia's balance sheet and back in to GPU financing deals. Shortseller Jim Chanos wrote an entire thread poking holes in the open AI deal, and yet he didn't declare that he's short Oracle. And that's pretty much where we're at with AI bears on Wall Street. It is very cheap right now to write a research note about the AI bubble, but it is starting to get extremely expensive to be stuck on the sideline as AI names rip. The Kobayisi letter simply pointed out the gigantic green candle writing, as we continue

Starting point is 00:14:10 to reiterate, we are still so early in the AI revolution. Take him, the author of the NVIDIA Way posted, ridiculous how outlets amplified and platformed that shoddly written MIT paper, conflated Altman's bubble quote, and are now barely mentioning the biggest AI infrastructure print since NVIDIA's May 2023. Again, ignore the noise and focus on fundamental reality. Meanwhile, Wall Street analysts are simply revising their targets up as fast as possible. Strategists for Wells Fargo, Barclays, and Deutsche Bank all boosted their targets for the S&P 500 this week, citing resilient earnings and an unrelenting AI investment cycle. Deutsche Bank now sees another 7% move coming by the end of the year, added to an 11% gain year-to-date.

Starting point is 00:14:49 And importantly, Oracle is the biggest move, but it's by no means the only one. Broadcom is also up more than 20% over the past month on the back of a $10 billion deal to manufacture chips for OpenAI. Tiny cloud provider Nebius is up more than 40% this week after signing a $17 billion partnership with Microsoft. Increasingly, the AI rally is no longer just about the mega-cap tech stocks pushing higher. there is an entire AI supply chain getting dramatically repriced as deals are put in place to power the next five years of development. And what's more, as we discuss AI jobs disruption, we're basically completely ignoring all that's going to go into servicing these big deals. Usually when you have a moment of creative destruction, the destruction comes before the creative. It's why new technology often faces resistance.

Starting point is 00:15:34 It's easier to see what it disrupts before we understand fully what it creates. In this case, because of infrastructure, we have the potential to see tens of thousands of new jobs materialize, entire categories of new skills that become extremely important and high leverage, very early on in the technology's lifecycle because of the need for infrastructure. You don't hear much about that because there's still such an impulse towards skepticism and not wanting to count chickens before they're hatched. But at least when it comes to the market, it is very clear that for the moment now, AI skepticism is firmly off the menu.

Starting point is 00:16:06 And I think that if you look around even beyond the markets, there's reasons to think that this shift is more broad-based as well. First of all, let's talk about Replit's new Agent 3. One of the things that was really interesting about the discourse around GPT5 was the extent to which it associated AI progress with simply the performance of the base model. Meanwhile, companies are creating entire systems and agents that build on top of models and take advantage of them that are getting more and more performance, even if it is true that base model performance is plateauing, which itself is a highly questionable proposition. But the

Starting point is 00:16:40 point is that even if it were plateauing, there are so many other places where we're seeing gains in actual practical production and performance. Agent 3 from Replit is a major step up in what autonomous coding agents can do. In fact, they're claiming that it's 10x more autonomous than their previous agents. The agent is capable of running tests and fixing bugs as it goes. It can also build other agents and automations, and can be used to build workflows within other apps. The biggest improvement is a 200-minute maximum runtime. Why Combinators Paul Graham wrote, it seems a bit counterintuitive at first, but one of the most important tests of an AI is how long it can continue thinking about something productively. Replet is now up to 200 minutes. CEO Amjad Masad called

Starting point is 00:17:23 this the full self-driving moment of software. One of the big benchmarks for agent autonomy over the past year has been meter's study of the time horizon for agentic tasks. The study measured agent performance against tasks that would take a human developer various lengths of time, and graded them at 50% and 80% accuracy. The study found that the agent time horizon has been doubling, on average every seven months, and seems to be increasing. Back in July, SWIX wrote, Ambien agents are going to completely dominate the rest of 2025. Human deep work and focus requires at least one to two hours uninterrupted. By end of year, all next-gen models will pass the one-to-two-hour autonomy meter barrier, meaning they will be used in completely different ways than the current one to 15-minute autonomy frontier.

Starting point is 00:18:05 Now, Massad brought up this study in the context of the release of Agent 3, writing, the meter paper that says that the length of tasks AI can do is doubling every seven months, radically undersells the scaling that we're seeing at Replit. It might be true if you're measuring one long trajectory for a single model class, but this is where an Agent Research Labs Alpha is. We build multi-agent architecture and use different models from various providers to tap into their latent abilities across various tasks. Another way of saying what he's saying is that while the meter paper might be a good benchmark for

Starting point is 00:18:33 general progress of the underlying models, it doesn't do a very good job of measuring the state of the art as a gentic scaffolding gets more complex. With this release, we're starting to move beyond coding agents that produce one-time apps and towards continuous AI workers. Gerard Lipscomb, a developer working with the Replit Stack, posted, one-shining apps was never the test of a vibe coding platform. The real benchmark is maintaining and improving products over time, while serving real users without breaking on every new change. Replit is moving faster in this direction than anyone else I'm aware of. As someone building my business on Replit, Autonomous App Testing and Automations is music

Starting point is 00:19:07 to my ears. It's yet another signal that they're building a special kind of full-stack ecosystem. The kind that one day might allow you to build not just a product, but a business that runs and improves itself. This is, I think, exactly what all of the discourse around how much better a base model is than the number that came before it misses. It misses the fact that the improvements are coming from these systems. which are being put together to accomplish ever bigger and more significant groups of work all at once.

Starting point is 00:19:33 Now, the last thing that I want to point to as another indication that we are at the beginning of a narrative shift once again away from skepticism and towards excitement again, is that Thinking Machines Lab came out with its first shared research and people are really excited. On Wednesday, the company put out a paper on the issue of nondeterminism or randomness in AI outputs. Basically, the idea that you could prompt the same AI two separate times with the exact same prompt and the exact same model and come up with two very different answers. This has been treated as just sort of the cost of doing business with LLMs and something that couldn't necessarily be fixed. However, the team at Thinking Machines Lab, led by Horace He, who

Starting point is 00:20:08 was recruited from Meta, argued that actually the problem here is something which they call batch invariants. The neuron actually had a really interesting layperson, ELI-5 kind of analogy. They said, imagine ordering the same coffee at Starbucks, but it tastes different depending on how many other customers are in line. That's essentially what's happening with AI models. When an AI server is busy handling lots of requests, it processes them in batches. Your request gets bundled with others because that's more efficient, and somehow this changes your specific answer even though it shouldn't. Follow the logic and the busier the server, the more your results vary. Thinking Machines Labs, or Thinky for short, release their approach as

Starting point is 00:20:42 open source code, which they're calling batch invariant kernels. Now, hold aside all the specifics here. This is a big, thorny, technical research problem that a lot of people had just given up on, And so the fact that this lab dropped a solution in a way that anyone else can interact with it has, I think, jogged people into remembering that there's actually still so much exciting science and research to be done that will fundamentally improve what we get out of AI. So summing this up, yesterday we got one of the biggest days in stock history because of a massive AI infrastructure deal and a halo effect that came over into other AI stocks as well. We got a brand new agent that totally pushed the frontiers of how much autonomy an agent could actually behave with. and we got some really exciting new science. That is why I am saying officially AI skepticism is canceled. RIP, hope you enjoyed your month in the spotlight.

Starting point is 00:21:33 For now, that's going to do it for the AI Daily Brief. Appreciate you listening or watching as always. And until next time, peace.

The AI Daily Brief: Artificial Intelligence News and Analysis - AI Skepticism is CANCELLED

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.