Tech Brew Ride Home - Thu. 04/17 – Mooooaaar Models!

Episode Date: April 17, 2025

OpenAI has a new reasoning model, and more lower cost models as well. Is OpenAI about to acquire a coding startup? Is Perplexity turning to Samsung for distribution and branding? A Neuralink rival get...s FDA approval. And why is Jensen Huang on a code-red mission to China at the moment? Sponsors: 1Password.com/ride Links: OpenAI's new o3 and o4-mini models are all about 'thinking with images' (Engadget) OpenAI debuts Codex CLI, an open source coding tool for terminals (TechCrunch) Vibe Check: o3 Is Here—And It’s Great (Every) OpenAI In Talks to Buy Windsurf for About $3 Billion (Bloomberg) Perplexity AI in Talks to Integrate Assistant Into Samsung, Motorola Phones (Bloomberg) Neuralink Rival Gets FDA Clearance in Path to Less Invasive Brain Implants (Bloomberg) Nvidia chief Jensen Huang flies to Beijing for talks (FT) Learn more about your ad choices. Visit megaphone.fm/adchoices

Transcript
Discussion (0)
Starting point is 00:00:00 On April 4th, 2023, around 2 in the morning, a man was found stabbed multiple times on a sidewalk in downtown San Francisco. Hey, who did this to you? What happened next turned the story into a political firestorm. Reports have identified the victim as Bob Lee, the founder of Cash App. From Bloomberg Podcasts, this is Foundering, the Killing of Bob Lee, beginning April 16. Welcome to the Tech meme right home for Thursday, April 17th, 2025. I'm Brian McCullough today. OpenAI has a new reasoning model and another lower cost model as well. Is OpenAI about to acquire a coding startup? Is perplexity turning to Samsung for distribution and branding? A neuralink rival gets FDA approval and why is Jetson Huang on a code red mission to China at the moment? Here's what you miss today in the world of tech. More models, everybody. OpenAI has launched Outs. O3, its most advanced AI reasoning model yet, as well as O4 Mini, a lower cost alternative that
Starting point is 00:01:15 still delivers, according to them, impressive results for chat GPT paid users. Quoting and gadget, a mere two days after announcing GPT4-1, OpenAI is releasing not one but two new models. The company today announced the public availability of O3 and O4 Mini. Of the former, OpenAI says O3 is its most advanced reasoning model yet, with it showing strong performance in coding math and science tasks. As for O3Mini, OpenAI is billing it as a lower cost alternative that still delivers impressive results across those same fields. More notably, both models offer novel capabilities not found in OpenAI's past systems. For the first time, the company's reasoning models can use and combine all of the tools available in chat GPT,
Starting point is 00:02:01 including web browsing and image generation. The company says this capability allows O3 and 04 Mini to solve challenging multi-step problems more effectively and, quote, take real steps toward acting independently. At the same time, O3 and O4 Mini can not just see images, but also interpret and think about them in a way that significantly extends their visual processing capabilities. For instance, you can upload images of whiteboards, diagrams, or sketches, even poor quality ones, and the new models will understand them. They can also adjust the images as part of how they reason. The combined power of state-of-the-art reasoning with full tool access translates into significantly stronger performance across academic benchmarks and real-world tasks, setting a new standard in both intelligence and usefulness, says OpenAI. Separately, OpenAI is releasing a new coding agent a la Claudecode named Codex CLI. It's designed to give developers a minimal interface they can use to link OpenAI's models with their local code.
Starting point is 00:03:02 out of the box it works with O3 and O4 Mini with support for GPT41 on the way, end quote. Yes, on that coding agent, quoting tech crunch, codex CLI appears to be a small step in the direction of OpenAI's broader agentic coding vision. Recently, the company's CFO, Sarah Fryer, described what she called the agentic software engineer, a set of tools OpenAI intends to build that can take a project description for an app and effectively create it, and even quality assurance test it. Codex CLI won't go that far, but it will integrate OpenAI's models, eventually including 03 and 04Many, with the clients that process code and computer commands, otherwise known as command line interfaces. It's also open source, OpenAI says. And a blog post provided to TechCrunch OpenAI added, you can get the benefits of multimodal reasoning from the
Starting point is 00:03:53 command line by passing screenshots or low fidelity sketches to the model combined with access to your code locally via Codex CLI. to spur use of Codex CLI, opening I plans to dole out $1 million in API grants to eligible software development projects. The company says it'll award $25,000 blocks of API credits to the project's chosen, end quote. But let's get back to 03 and the fact that it is a reasoning model. Dan Shipper at Every wrote up a summary of his experiences with it, and while I encourage you to read his entire thing to get a sense of the different ways he used it, I'm just going to read from his conclusion here, quote, this is the biggest one.
Starting point is 00:04:30 wow moment I've had with a new open AI model since GPT4. The company has successfully lengthened the leash that an AI gets in order to do its tasks. Now you can reliably let it work for minutes at a time to get higher quality answers with no intervention. But what makes O3 especially powerful is the integration between the model and chat GPT. It has tools like web search, reminders, memories, and code interpreter that allow users to access more of its power with less hassle. This is a very valuable strategic position for OpenAI to be in. one that will further its lead as the destination chapout of choice for this AI era. O3 is also a lesson in the seasons of change that companies go through.
Starting point is 00:05:10 For about a year after Sam Altman's firing, Open AI was slow to release new products, and analysts were prognosticating about its downfall. With this release stacked on its deep research and GPD 4.5 recently, OpenAI is back on a tear, end quote. Sticking with Open AI for a second, Bloomberg says, Open AI is in talks to acquire WindSurf, an AI coding tool formerly known as codium for around $3 billion. WindSurf was valued at $1.25 billion in a 2024 funding round. So we know that coding AI is the new hotness, and Open AI has been moving to plant their flag in this arena.
Starting point is 00:05:56 But what does this startup have that Open AI doesn't think they can build on their own? Quoting from Bloomberg. The deal would be OpenAI's largest acquisition to date and could help the company take on rising competition in the market for AI-driven coding assistance, systems capable of tasks like writing code based on natural language prompting. WindSurf, formerly called ExaFunction, had recently been in talks with investors, including Kleiner Perkins and General Catalysts to raise funding at a $3 billion valuation. The company was valued at $1.25 billion in a deal led by General Catalyst last year.
Starting point is 00:06:30 If the deal closes, Open AI would compete more directly with companies like Anthropos, Microsoft-owned GitHub and AnySphere, which offers services in the fast-growing field of AI coding. The deal would also be a signal that, as more companies pile into the sector, the number of mergers and acquisitions will rise. OpenAI has acquired other startups before, including Vector Database Company Rockset and Multi, a remote collaboration platform. Investors have recently been enthusiastically backing AI coding tools, including AnySphere, the startup behind Cursor, which was in talks with investors to raise money at a
Starting point is 00:07:03 evaluation of almost $10 billion earlier this year, end quote. Three interesting tweets on this. First, quoting Austin Al-Read. If OpenAI buys WinServe, does it get data as to all of the input prompts and then outputs for all of the different models? I think it might. And you'd be able to see whether the output is accepted or rejected, end quote. Then Zach Weinberg.
Starting point is 00:07:26 I'm sure everyone deep in the industry already knew this, but very clear the core model companies realize they need to own the customer slash application layer to actually make money. Tons of M&A likely coming to fight for the most mature application layer companies in the biggest categories, coding, marketing, maybe customer service will be very tough to compete with the OpenAI or Google or Microsoft application bundle in the long run, end quote. Then at semi-analysis, cursor and Anthropic had a mutually beneficial relationship, but labs realize controlling the main application of a model is as valuable as owning the model itself. With this acquisition, Open AI gains greater ecosystem control and can build better products, end quote.
Starting point is 00:08:15 The race is real, folks. Bloomberg sources say perplexity is in discussions to integrate its AI assistant on Samsung's devices and has reached an agreement to preload perplexity on Motorola's phones. Quote, the details of the Samsung deal are still being hashed out, the people said, the terms could involve offering perplexity as a default AI assistant option or preloading the startup's Android app on phones. Samsung, the world smartphone market leader, also could promote the assistant heavily as an option within the Galaxy store where users download apps. A deal with perplexity is complicated by Samsung's wide-ranging partnership with Alphabet, which powers many of the AI features on Samsung devices and provides the default search engine.
Starting point is 00:08:59 But Samsung has been fostering a relationship with perplexity for months. South Korea-based company's investment arm called Next, backed the startup last year. Samsung is discussing making another investment in perplexity in the near future, the people said. The startup has been in broader talks with investors for a new round of funding, which would double its valuation to $18 billion. It is discussed raising between $500 million and $1 billion. Bloomberg News reported in March. Founded in 2022, Perplexity has gained popularity for real-time results and search engine-like user interfaces. The assistant has the ability to service in-depth research, and perplexity can tap into models beyond its own, including those
Starting point is 00:09:36 from OpenAI and Anthropic PBC. The Motorola deal will include perplexity being preloaded as an alternative assistant to Google's Gemini. There will also be a tailored user interface for Motorola's updated razor-foldable phones, and marketing from the device maker will encourage customers to try perplexity. Today, Motorola offers its own homegrown AI system and access to Google Gemini. Samsung takes a similar approach, providing its Bixby service alongside Google's AI platform. For San Francisco-based perplexity, securing Motorola and Samsung deals would be a major boon to building its brand and gaining popularity in the U.S. where ChatGPT and Gemini are more established. While Motorola's market share is slim in the smartphone industry, the company is making
Starting point is 00:10:17 strides. Samsung, for its part, has a share of about 20% globally, according to data from IDC, end quote. The U.S. Food and Drug Administration has cleared Neurrelink rival precision neuroscience to offer a less invasive brain implant. Quoting Bloomberg. One of the co-founders of Elon Musk's Neurolake Corp is building a different kind of brain implant. That device made by the startup precision neuroscience corp went in 63-year-old Tim Fisher earlier this month. In an operating room in Philadelphia, a surgeon placed a sliver of film thinner than a human hair and embedded with over a thousand electrodes on Fisher's brain. After a little training when Fisher moved his left hand, so did a nearby robotic appendage, guided by signals from his brain and the new brain patch.
Starting point is 00:11:12 It was an amazing experience, said Fisher, a retail worker who lives in Westchester, Pennsylvania, and has Parkinson's disease. He said he's drawn to these kinds of implants because his tremors make everyday tasks like typing or opening a jar near impossible. The brain device that Fisher used was cleared by the U.S. Food and Drug Administration late last month for limited uses, bringing precision one step closer to a commercially available technology that one day might allow people to control computers or other devices with just their thoughts. That could help people with paralysis, electronically communicate with others, use computer software, and get a job. The device is already being used to help doctors map brain activity in more detail, to plan
Starting point is 00:11:50 procedures, and could also enable researchers to study the brain more closely and accurately. Precision was started in 2021 by Michael Magar and Benjamin Rapaport, a brain surgeon who previously helped co-found Neurrelink, in contrast with some other experimental systems like Neurilinks, which stick into the brain tissue, precision's strip is designed to sit on top, Rappaport said. The idea is to build something that can be inserted and removed without leaving a trace on the brain. Fisher's Patch, for instance, was only in place temporarily during the procedure. Around the size of a U.S. postage stamp, the precision device contains 1,024 tiny metal electrodes that are each a fraction of a millimeter in diameter.
Starting point is 00:12:28 It's been studied in at least 37 patients across five medical centers and about one-third of U.S. medical centers that do major brain surgeries have expressed interest in it, according to the company. We've been amazed by the demand for it, Rappaport said. The strip was greenlit by the FDA to record, monitor, and stimulate brain activity and can be used for less than 30 days. The regulator cleared the device based on a submission that included testing, but not the human studies. At least one other company, BlackRock NeuroTech, has received a similar clearance for its Neuroport Array device, end quote. I know, I know, I promise not to do this, but tech stocks were down, significantly yesterday. What's worrying people now? Well, that's what I want to talk about. It's
Starting point is 00:13:14 about that wrinkle around chips and tariffs and export controls, but also specifically about NVIDIA. All the big chip players, AMD, NVIDIA, and ASML were down 7% yesterday. And NVIDIA CEO Jensen Huang apparently arrived in China yesterday and visited Beijing today, a source says, to meet officials to discuss Trump's new chip export rules. quoting the FTE. According to two people familiar with his travel schedule, Huang met NVIDIA clients, including the founder of Generative Artificial Intelligence Startup Deepseek to discuss new chip designs for Chinese customers. He then held separate talks with Chinese premier He Lai Fang, according to one person familiar with the meeting. Huang said China
Starting point is 00:13:57 was a very important market for NVIDIA and expressed hope that his company could, quote, continue cooperating with the country, according to state broadcaster CCTV. On Tuesday, NVIDIA said it expected a $5.5 billion hit to earnings from new U.S. export restrictions on its H-20 chip, a lower-powered model that had already been designed to comply with Joe Biden-era controls limiting exports to China. Wong's talks indicate that NVIDIA is not willing to give up on the Chinese market and is considering designing yet another chip for it, even though its previous efforts have been banned by Washington. Plans for the NVIDIA chief's visit to Beijing were finalized after U.S. President Donald Trump's unexpected move to ban the H-20 chip. Invidia reported $17 billion in sales from China last year,
Starting point is 00:14:42 but faced growing threats to its business from Beijing, even before Trump interceded, end quote. In a separate Financial Times piece, they report that NVIDIA is hustling now because they were actually caught flat-footed. After a meeting with Trump at his Florida residents at Mara Lago earlier this month, Nvidia executives were left with the impression they could escape tougher enforcement of any curbs, the people said, adding the company's plan to invest $500 billion in the U.S. had also impressed the president. This led NVIDIA to tell Chinese clients, including tech giants, Alibaba, bite dance, and Tensit, that orders of H20s would not be affected, the people said. NVIDIA was then blindsided as Trump decided to clamp down on the export of H20, a product
Starting point is 00:15:26 that Chinese tech groups have relied upon in their efforts to challenge their global peers to develop large language models. Frustrated Chinese tech companies have complained about not having enough warning about such a major policy change, but our understanding that the shift is beyond the control of Nvidia, according to the people with knowledge of recent discussions. AI demand jumped in China after DeepSeek's successful launch of its low-cost reasoning model led local companies to put almost $17 billion in orders for H20 chips this year, according to one of the people.
Starting point is 00:15:54 While Nvidia typically takes more than six months to deliver such chips, most of this year's orders from its Chinese clients are yet to be filled and will probably be affected by the latest U.S. restrictions. The $5.5 billion hit-to-earnings NVIDIA announced are mostly the cost of materials to be used to produce such orders and related penalties and operational costs for not delivering based on agreed terms. The actual affected revenue from China could be more than $10 billion, the person estimated. China's tech giants are racing to find a replacement to the H-20, while Trump's new export controls could significantly help the sales of domestic manufacturers, led by Huawei, which has been pushing to produce more AI processors. It also remains unclear how Chinese groups can apply for a license to obtain H-20s and on what basis they would be issued. Intel told its Chinese clients last week that chips would require a license for exporting to China if they have a total D-RAM bandwidth of 1,400 gigabytes per second or more,
Starting point is 00:16:50 IO bandwidth of 1,100 gigabytes per second or more, according to a company email. Intel's Gaudy series as well as Nvidia's H20 far exceed these requirements, end quote. So yeah, sounds like Jensen is on a code red mission. As soon as I hit publish on this, we are going to take a tour of the Stanley Hotel, which is what the hotel and the Shining is based on. Apparently Stephen King came here in the 70s right when the Stanley was shutting down for the season. And boom, inspiration for the Shining. after he visited the hotel bar and met with the bartender whose name coincidentally was
Starting point is 00:17:43 Lloyd. Talk to you tomorrow.

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.