Tech Brew Ride Home - Fri. 11/22 – An LLM Siri But… Only The Year AFTER Next?

Starting point is 00:00:00 On April 4th, 2023, around 2 in the morning, a man was found stabbed multiple times on a sidewalk in downtown San Francisco. Hey, who did this to you? What happened next turned the story into a political firestorm. Reports have identified the victim as Bob Lee, the founder of Cash App. From Bloomberg Podcasts, this is Foundering, the Killing of Bob Lee, beginning April 16. Welcome to the Tech meme right home for Friday, November 22nd, 2024. I'm Brian McCullough today. Wait. How long is it going to take Apple to make Siri behave like ChatGPT already does today?

Starting point is 00:00:45 Maybe that talk of OpenAI buying the Chrome browser isn't completely as far-fetched as I thought. Is Threads feeling the heat from blue sky? Do I want to wear a watch on my finger? And of course, the weekend long-read suggestions. Here's what you miss today in the world of tech. From the, well, of course they are. why wouldn't they be? It would be almost criminally negligent if they weren't file. Mark German says, Apple is testing a more conversational version of Siri, dubbed LLM Siri,

Starting point is 00:01:17 with plans to release it in the spring of 2026 as part of iOS 18 and MacOS 16. Quoting Bloomberg, the new Siri, details of which haven't been reported, uses more advanced large language models or LLMs to allow for back-and-forth conversations, said the people who asked not to be identified because the effort hasn't been announced. The system also can handle more sophisticated requests in a quicker fashion, they said. The new voice assistant, which will eventually be added to Apple Intelligence, is dubbed LLM Siri by those working on it. LLMs, a building block of generative AI gorge on massive amounts of data in order to identify patterns and answer questions. Apple has been testing the upgraded software on iPhones, iPads, and Macs as a separate app,

Starting point is 00:02:01 but the technology will ultimately replace the Siri interface that users rely on today. The company is planning to announce the overhaul as soon as 2025 as part of the upcoming iOS 19 and MacOS 16 software updates, which are internally named Luck and Cheer, the people said. The revamped Siri will rely on new Apple AI models to interact more like a human and handle tasks in a way that's closer to chat GPT and Google's Gemini. It also will make expanded use of app intents, which allow for more precise, control of third-party apps, and the software will be able to tap into features from Apple Intelligence, such as the ability to write and summarize text. Like Apple Intelligence this fall, the new features won't immediately be included in next year's crop of hardware devices. Instead,

Starting point is 00:02:46 Apple is currently planning to release the new Siri to consumers as early as spring 2026, about a year and a half from now. Given that Apple is still several months away from announcing the plan, the timing and features could still shift, end quote. early as spring 2026? This might be the most powerful signal we've seen yet. How far behind Apple is

Starting point is 00:03:11 really an AI? They need 18 months just to add LLMs to Siri. Like, how much do you think the chatbot scene will have changed by the time they get feature parody to what smaller players already have in the market today? Quoting Joe Rosensteele on

Starting point is 00:03:26 Mastodon, there's bound to be a really interesting story of corporate intrigue about how the people in charge of Siri have been actively opposed to virtual assistance for a decade and have been waging a silent campaign of sabotage behind the scenes. Either that or a really boring story about unimaginative managers hitting minor progress targets for their bonuses, end quote. About that Google selling Chrome to Open AI possibility, I don't know that this materially changes the odds of it actually happening, but for what it's worth, the information is reporting that OpenAI has considered making a browser in the past,

Starting point is 00:04:09 even discussed deals to power AI features on Samsung devices, and search on sites and apps from Kandai Nass and others. Quote, if OpenAI launches some or all of these products, it would become an even bigger competitor to Google, which dominates the browser market with Chrome and the search market with Google Search, and powers Samsung's phones with its Android software. More recently, Google's Gemini AI began powering. features on Samsung devices, such as providing a text summary or a voice recording or using image-generating technologies to edit photos. The state of talks between Samsung and OpenAI

Starting point is 00:04:44 couldn't be learned, but Google has been preparing for the possibility of competing with OpenAI to power such features, set a person with knowledge of the situation. For Samsung, it makes sense to have more than one potential provider of such technology as it negotiates the terms of deals. ChatGPT generates billions of dollars a year in revenue from subscriptions and has added search-like features that show real-time information from the web, in part with help from Microsoft's search technology, but ChatGPT hasn't visibly hurt Google search yet, even if some people are using it as a partial replacement for Google. Still, ChatGPT is growing quickly and currently dominates the nascent market for AI chatbots. Making a web browser could help OpenAI have more control

Starting point is 00:05:25 over a primary gateway through which people use the web, as well as further boosting ChatCHPT, which has more than 300 million weekly users just two years after its launch. It isn't clear how a chat GPT browser's features would differ from those of other browsers. In a signal of its interest in a browser, several months ago, OpenAI hired Ben Gujar, a founding member of the Chrome team at Google. Another recent hire is Darren Fisher, who worked with Googer to develop Chrome, but OpenAI isn't remotely close to launching a browser, multiple people said. Launching a browser is timely and complicated.

Starting point is 00:06:01 because browser providers need to ensure people's data doesn't leak to websites, and the browser needs to work with various types of extensions to adequately compete with incumbent browsers, among other things, end quote. I continue to say that the sudden explosion over on blue sky is not necessarily or so much people fleeing X, as it really might be people finally giving up on threads. Blue Sky is closer to old Twitter than basically anything else is at the moment. And that's all people ever wanted, I argue. Because meta made threads in its inimitable meta way, it does not function well as a breaking news platform or a digital town square. Like when the Matt Gates news

Starting point is 00:06:50 broke the other day within seconds, it was immediately all over X, all over blue sky, but over at threads, as M.G. Siegler joked, you were still being served up posts like, oh, what a lovely new lens for my like a camera. Well, maybe Threads is feeling the heat because Adam Osseri says, oh yeah, maybe we will tweak some of the things to give you what you've been asking me for four months now. Quoting the Verge, we are rebalancing ranking to prioritize content from people you follow, which will mean less recommended content from accounts you don't follow and more posts from the accounts you do starting today, Threads boss Adam Osseri announced on Thursday. This is another significant change to Threads, since Minutes.

Starting point is 00:07:32 people started flocking to blue sky. It lets people default to seeing their following feed when they open the app and offers a lot of customizability, including custom feeds. Threads just yesterday rolled out its take on custom feeds less than a week after it started testing them. The change could make the For-U feed include more accounts you care about, but we'll have to wait and see if the updates address the problem of the feed surfacing posts that are very old and no longer timely. You still can't leave threads in the following feed, though. It also means that creators should expect unconnected reach to go down, but connected reach to go up, Moseri says, end quote.

Starting point is 00:08:11 On those quickly rolled out custom feeds, features, quoting TechCrunch, hoping to capitalize on user demand for more personalization, custom feeds are meant to allow Threads users to easily build feeds around specific topics or those including certain user profiles. This would make it easier for Threads users to tap into the communities and conversations that are most important to them and could help to challenge. Blue Sky's own set of tools for personalization, including those that let users build their own algorithms, feeds, and lists, as well as those that let them configure their own moderation tools. The global launch of custom feeds on threads comes only days after Meta CEO Mark Zuckerberg

Starting point is 00:08:47 announced the feature was entering testing. That signals that threads is moving quickly to topple some of the momentum that Blue Sky has recently seen. Since the U.S. elections, Blue Sky adoption has soared as users looked for an alternative to the now more right-leaning X owned by Elon Musk. Other decisions at X have also pushed people to depart, including how it has changed the block feature and its policy around training AI models on user data. Some users on threads have been disappointed by its decision to deprioritize politics and have asked for other options beyond the default algorithmic 4U feed and the chronological following feed. That's where custom feeds come in. To use the feature, you'll first need to search for and then tap into a topic to see the latest posts. From there, you'll tap on the three dot icon next to the search term and choose the option

Starting point is 00:09:34 create new feed. You can also choose to add specific user profiles to a feed by visiting the user's profile, tapping the three dot icon above their profile picture, and then tapping to add them to one of your feeds. After the feeds are created, they can be pinned to the top of the threads home screen on the web in the columns-based view similar to Twitter's tweet deck or X-Pro, making them easier to access, end quote. Interesting gadget here. How about a smart ring like an or a smart ring, but it's actually a watch?

Starting point is 00:10:11 Specifically a tiny little old and timely Cassio watch. This is not from the onion, quoting the verge. Not to be outdone by its longtime rival, Cassio has announced its own digital ring watch that brings more functionality than Timex's wearable that debuted last month, created to help commemorate the 50th anniversary of Cassio getting into the digital watch. business, the CRW001JR will be available in Japan starting in December for around $128. Although the tiny watch's case is just shy of being an inch in size, Cassio has managed to squeeze in a retro six-segment LCD screen that can display hours, minutes, and seconds. The ring also includes three functional buttons that can control additional features like displaying the date or the time in a different time zone and a stopwatch.

Starting point is 00:11:00 The ringwatch's screen even has a light and an alarm function that will flash the display instead of playing an audible sound. It's powered by a single battery that Cassio says will keep the waterproof watch running for about two years, but is also easily replaceable when it does die. To accurately recreate the design and intricate details of the larger digital watches the ring is based on, Cassio says the wearable is manufactured as a single piece using a metal injection molding process That starts with powdered metal. As a result, unlike Timex's watch ring, that features a stretchable band to accommodate different users, the CRW001JR is permanently a U.S. size 10.5. If your fingers are smaller than that, Cassio includes a couple of spacers to improve the fit. If your fingers are larger than that, you'll need to find some other way to check the time, end quote. As ever, this is probably one of those segments that it's worth clicking through on to see the pictures of. And then tell me I don't need this. Tell me I don't need this.

Starting point is 00:12:02 Tell me I don't need this. Time for the weekend long read suggestions. First up from the verge. Just a pretty comprehensive guide to blue sky and what exactly the features are that people are flocking over there to enjoy. Like the custom algorithmic feeds and how to set them up, the starter packs and how to make use of them. The ability to do custom domains and change your handle and labelers, a thing that I did not know exist. then a chaser to the whole idea of Apple being behind in AI. From Fortune, a look at how Mark Zuckerberg made Lama a cornerstone of the AI ambitions at Meta, whose smartphone-era services and products

Starting point is 00:12:48 have always been constrained by Apple and Google, as we know. Quote, while Chad GPT remains the dominant gen AI tool in the popular imagination, Lama models now power many, if not most, of the meta products that billions of consumers encounter every day. Meta's AI assistant, which reaches across Facebook, Instagram, WhatsApp, and Messenger is built with Lama, while users can create their own AI chatbot with AI studio. Text generation tools for advertisers are built on Lama. Lama helps power the conversational assistant that is part of META's hit Rayban glasses and the feature in the Quest headset that lets users ask questions about their surroundings. The company is said to be developing its own AI-powered search engine and outside its wall.

Starting point is 00:13:30 Lama models have been downloaded over 600 million times on sites like open source AI community hugging face. Still, the pivot has perplexed many meta-watchers. The company has spent billions to build the Lama models. On its third quarter earnings column, it announced that it projects capital expenditures for 2024 to reach as high as $40 billion, with a, quote, significant increase, likely in 2025. Meanwhile, it's giving Lama away for free to thousands of companies, including giants like Goldman Sachs, AT&T, and Accenture. Some investors are struggling to understand where and when exactly meta's revenue would start to justify the eye-watering spend. Nonetheless, Lama's contrarian success has allowed Zuckerberg to shrug off the lukewarm response to his metaverse ambitions and the company's painful year of efficiency in late 2022 and early 2023.

Starting point is 00:14:18 The rise of Lama has also given Zuckerberg a chance to address a long, simmering, sore point in his otherwise meteoric career. The fact that Facebook and now META have so often seen their services and products constrained, by rules imposed by Apple and Google, the rival giants whose app stores are Meta's primary points of distribution in the mobile device era. As he wrote in a July blog post, we must ensure that we always have access to the best technology and that we're not locked into a competitor's closed ecosystem where they can restrict what we build. With Lama, Mena and Zuckerberg have the chance to set a new industry standard. I think we're going to look back at Lama 3.1 as an inflection point in the industry where open source AI started to become the industry

Starting point is 00:15:00 just like Linux is, he said on Meta's July earnings call, invoking the open source project that disrupted the dominance of proprietary operating systems like Microsoft Windows, end quote. And from the Financial Times, a long read, and I mean a long one, but a very, very interesting, detailed look at how AI, as we've discussed before, might be the technology that unlocks robotics at scale, quote, the extraordinary progress in generating text and images using AI over the past two years have been accomplished by the invention of large language models, the systems underpinning chatbots. Roboticists are now building on these and their cousins visually conditioned language models, sometimes called vision language models, which connect

Starting point is 00:15:44 textual information and imagery. With access to huge existing troves of text and image data, researchers can pre-train their robot models on the nuances of the physical world and how humans describe it, even before they begin to teach their machine students' specific actions. But in the chaotic, ever-changing real world, machines not only need to be able to execute individual tasks such as these, they also need to carry out a multitude of jobs in different settings. Those at the heart of robotics believe the answer to this generalization is to be found in foundation models for the physical world, which will draw on growing databases relating to movement banks of information recording robot actions. The hope is these large behavioral models, once big enough, will help machines adapt to new,

Starting point is 00:16:29 and unpredictable environments, such as commercial and domestic settings, rapidly transforming business and our home lives. But these models face numerous challenges beyond what is required for linguistic generative AI. They have to drive actions that obey the laws of physics in a three-dimensional world and adapt to dynamic environments occupied by other living things. The current challenge for developing these large behavioral models is the scarcity of data, a difficulty also facing large language models as a result of human information sources being exhausted, but a great communal effort is underway among the robotics community to generate new sets of training data. We've been seeing more data be available, including data for very dexterous tasks, and seeing a lot

Starting point is 00:17:09 of the fruits of putting that data in, says Stanford's Finn. It suggests that if we are able to scale things up further, then we might be able to make significant breakthroughs in allowing robots to be successful in real-world environments. Click through on this one, because tons of graphics are in there to explain everything. The weekend bonus episode this weekend is me on another podcast, the Daily Detroit podcast. I'm sharing this one because the topic we discuss is something that I've not really been able to get into on this show, the how and why I think it's so hard to seed new tech

Starting point is 00:17:54 hubs in places beyond Silicon Valley. And crucially, what I think regions should do to actually make that happen. Listen, enjoy, talk to you on Monday.

Tech Brew Ride Home - Fri. 11/22 – An LLM Siri But… Only The Year AFTER Next?

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.