Tech Brew Ride Home - Fri. 11/22 – An LLM Siri But… Only The Year AFTER Next?
Episode Date: November 22, 2024Wait, how long is it going to take Apple to make Siri behave like ChatGPT already does today? Maybe that talk of OpenAI buying the Chrome browser isn’t completely far fetched after all. Is Threads f...eeling the heat from Bluesky? Do I want to wear a watch on my finger? And, of course, the Weekend Longreads Suggestions. Links: Apple Readies More Conversational Siri in Bid to Catch Up in AI (Bloomberg) OpenAI Considers Taking on Google With Browser (The Information) Threads’ algorithm will focus more on the people you follow (The Verge) As Bluesky soars, Threads rolls out custom feeds globally (TechCrunch) Casio’s first smart ring has innovative features like a stopwatch and flashing alarm (The Verge) The Weekend Longreads Suggestions: Here’s some cool stuff you can do with Bluesky (The Verge) How Mark Zuckerberg has fully rebuilt Meta around Llama (Fortune) Are the robots finally coming? (FT) Learn more about your ad choices. Visit megaphone.fm/adchoices
Transcript
Discussion (0)
On April 4th, 2023, around 2 in the morning, a man was found stabbed multiple times on a sidewalk in downtown San Francisco.
Hey, who did this to you?
What happened next turned the story into a political firestorm.
Reports have identified the victim as Bob Lee, the founder of Cash App.
From Bloomberg Podcasts, this is Foundering, the Killing of Bob Lee, beginning April 16.
Welcome to the Tech meme right home for Friday, November 22nd, 2024.
I'm Brian McCullough today. Wait.
How long is it going to take Apple to make Siri behave like ChatGPT already does today?
Maybe that talk of OpenAI buying the Chrome browser isn't completely as far-fetched as I thought.
Is Threads feeling the heat from blue sky?
Do I want to wear a watch on my finger?
And of course, the weekend long-read suggestions.
Here's what you miss today in the world of tech.
From the, well, of course they are.
why wouldn't they be? It would be almost criminally negligent if they weren't file.
Mark German says, Apple is testing a more conversational version of Siri, dubbed LLM Siri,
with plans to release it in the spring of 2026 as part of iOS 18 and MacOS 16.
Quoting Bloomberg, the new Siri, details of which haven't been reported, uses more advanced
large language models or LLMs to allow for back-and-forth conversations, said the people who
asked not to be identified because the effort hasn't been announced. The system also can handle more
sophisticated requests in a quicker fashion, they said. The new voice assistant, which will eventually be
added to Apple Intelligence, is dubbed LLM Siri by those working on it. LLMs, a building block
of generative AI gorge on massive amounts of data in order to identify patterns and answer questions.
Apple has been testing the upgraded software on iPhones, iPads, and Macs as a separate app,
but the technology will ultimately replace the Siri interface that users rely on today.
The company is planning to announce the overhaul as soon as 2025 as part of the upcoming iOS 19 and MacOS 16 software updates,
which are internally named Luck and Cheer, the people said.
The revamped Siri will rely on new Apple AI models to interact more like a human and handle tasks in a way that's closer to chat GPT and Google's Gemini.
It also will make expanded use of app intents, which allow for more precise,
control of third-party apps, and the software will be able to tap into features from Apple
Intelligence, such as the ability to write and summarize text. Like Apple Intelligence this fall,
the new features won't immediately be included in next year's crop of hardware devices. Instead,
Apple is currently planning to release the new Siri to consumers as early as spring 2026,
about a year and a half from now. Given that Apple is still several months away from announcing
the plan, the timing and features could still shift, end quote.
early as spring
2026?
This might be the most
powerful signal we've seen yet.
How far behind Apple is
really an AI? They need
18 months just to add
LLMs to Siri. Like, how
much do you think the chatbot scene will have
changed by the time they get feature parody to
what smaller players already have in the market
today?
Quoting Joe Rosensteele on
Mastodon, there's bound to
be a really interesting story of
corporate intrigue about how the people in charge of Siri have been actively opposed to virtual
assistance for a decade and have been waging a silent campaign of sabotage behind the scenes.
Either that or a really boring story about unimaginative managers hitting minor progress targets
for their bonuses, end quote. About that Google selling Chrome to Open AI possibility,
I don't know that this materially changes the odds of it actually happening, but for what it's worth,
the information is reporting that OpenAI has considered making a browser in the past,
even discussed deals to power AI features on Samsung devices, and search on sites and apps from
Kandai Nass and others.
Quote, if OpenAI launches some or all of these products, it would become an even
bigger competitor to Google, which dominates the browser market with Chrome and the
search market with Google Search, and powers Samsung's phones with its Android software.
More recently, Google's Gemini AI began powering.
features on Samsung devices, such as providing a text summary or a voice recording or using
image-generating technologies to edit photos. The state of talks between Samsung and OpenAI
couldn't be learned, but Google has been preparing for the possibility of competing with
OpenAI to power such features, set a person with knowledge of the situation. For Samsung,
it makes sense to have more than one potential provider of such technology as it negotiates
the terms of deals. ChatGPT generates billions of dollars a year in revenue from subscriptions
and has added search-like features that show real-time information from the web, in part with help
from Microsoft's search technology, but ChatGPT hasn't visibly hurt Google search yet, even if some
people are using it as a partial replacement for Google. Still, ChatGPT is growing quickly and currently
dominates the nascent market for AI chatbots. Making a web browser could help OpenAI have more control
over a primary gateway through which people use the web, as well as further boosting ChatCHPT,
which has more than 300 million weekly users just two years after its launch.
It isn't clear how a chat GPT browser's features would differ from those of other browsers.
In a signal of its interest in a browser, several months ago, OpenAI hired Ben Gujar,
a founding member of the Chrome team at Google.
Another recent hire is Darren Fisher, who worked with Googer to develop Chrome,
but OpenAI isn't remotely close to launching a browser, multiple people said.
Launching a browser is timely and complicated.
because browser providers need to ensure people's data doesn't leak to websites,
and the browser needs to work with various types of extensions to adequately compete with incumbent browsers,
among other things, end quote.
I continue to say that the sudden explosion over on blue sky is not necessarily or so much people fleeing X,
as it really might be people finally giving up on threads.
Blue Sky is closer to old Twitter than basically anything else is at the moment.
And that's all people ever wanted, I argue. Because meta made threads in its inimitable meta way,
it does not function well as a breaking news platform or a digital town square. Like when the Matt Gates news
broke the other day within seconds, it was immediately all over X, all over blue sky, but over at
threads, as M.G. Siegler joked, you were still being served up posts like, oh, what a lovely new
lens for my like a camera. Well, maybe Threads is feeling the heat because Adam Osseri says,
oh yeah, maybe we will tweak some of the things to give you what you've been asking me for
four months now. Quoting the Verge, we are rebalancing ranking to prioritize content from people
you follow, which will mean less recommended content from accounts you don't follow and more
posts from the accounts you do starting today, Threads boss Adam Osseri announced on Thursday.
This is another significant change to Threads, since Minutes.
people started flocking to blue sky. It lets people default to seeing their following feed when they
open the app and offers a lot of customizability, including custom feeds. Threads just yesterday rolled
out its take on custom feeds less than a week after it started testing them. The change could
make the For-U feed include more accounts you care about, but we'll have to wait and see
if the updates address the problem of the feed surfacing posts that are very old and no longer
timely. You still can't leave threads in the following feed, though.
It also means that creators should expect unconnected reach to go down, but connected reach to go up,
Moseri says, end quote.
On those quickly rolled out custom feeds, features, quoting TechCrunch, hoping to capitalize
on user demand for more personalization, custom feeds are meant to allow Threads users to easily
build feeds around specific topics or those including certain user profiles.
This would make it easier for Threads users to tap into the communities and conversations
that are most important to them and could help to challenge.
Blue Sky's own set of tools for personalization, including those that let users build their own algorithms,
feeds, and lists, as well as those that let them configure their own moderation tools.
The global launch of custom feeds on threads comes only days after Meta CEO Mark Zuckerberg
announced the feature was entering testing. That signals that threads is moving quickly to topple some
of the momentum that Blue Sky has recently seen. Since the U.S. elections, Blue Sky adoption has soared as
users looked for an alternative to the now more right-leaning X owned by Elon Musk.
Other decisions at X have also pushed people to depart, including how it has changed the block feature and its policy around training AI models on user data.
Some users on threads have been disappointed by its decision to deprioritize politics and have asked for other options beyond the default algorithmic 4U feed and the chronological following feed.
That's where custom feeds come in.
To use the feature, you'll first need to search for and then tap into a topic to see the latest posts.
From there, you'll tap on the three dot icon next to the search term and choose the option
create new feed.
You can also choose to add specific user profiles to a feed by visiting the user's profile,
tapping the three dot icon above their profile picture, and then tapping to add them to one
of your feeds.
After the feeds are created, they can be pinned to the top of the threads home screen on
the web in the columns-based view similar to Twitter's tweet deck or X-Pro, making them
easier to access, end quote.
Interesting gadget here. How about a smart ring like an or a smart ring, but it's actually a watch?
Specifically a tiny little old and timely Cassio watch. This is not from the onion, quoting the verge.
Not to be outdone by its longtime rival, Cassio has announced its own digital ring watch that brings more functionality than Timex's wearable that debuted last month,
created to help commemorate the 50th anniversary of Cassio getting into the digital watch.
business, the CRW001JR will be available in Japan starting in December for around $128.
Although the tiny watch's case is just shy of being an inch in size, Cassio has managed to squeeze in
a retro six-segment LCD screen that can display hours, minutes, and seconds.
The ring also includes three functional buttons that can control additional features like
displaying the date or the time in a different time zone and a stopwatch.
The ringwatch's screen even has a light and an alarm function that will flash the display instead of playing an audible sound.
It's powered by a single battery that Cassio says will keep the waterproof watch running for about two years,
but is also easily replaceable when it does die.
To accurately recreate the design and intricate details of the larger digital watches the ring is based on,
Cassio says the wearable is manufactured as a single piece using a metal injection molding process
That starts with powdered metal. As a result, unlike Timex's watch ring, that features a stretchable band to accommodate different users, the CRW001JR is permanently a U.S. size 10.5. If your fingers are smaller than that, Cassio includes a couple of spacers to improve the fit. If your fingers are larger than that, you'll need to find some other way to check the time, end quote. As ever, this is probably one of those segments that it's worth clicking through on to see the pictures of.
And then tell me I don't need this.
Tell me I don't need this.
Tell me I don't need this.
Time for the weekend long read suggestions.
First up from the verge.
Just a pretty comprehensive guide to blue sky and what exactly the features are that people are flocking over there to enjoy.
Like the custom algorithmic feeds and how to set them up, the starter packs and how to make use of them.
The ability to do custom domains and change your handle and labelers, a thing that I did not know exist.
then a chaser to the whole idea of Apple being behind in AI. From Fortune, a look at how Mark Zuckerberg
made Lama a cornerstone of the AI ambitions at Meta, whose smartphone-era services and products
have always been constrained by Apple and Google, as we know. Quote, while Chad GPT remains the dominant
gen AI tool in the popular imagination, Lama models now power many, if not most, of the
meta products that billions of consumers encounter every day. Meta's AI assistant, which reaches across
Facebook, Instagram, WhatsApp, and Messenger is built with Lama, while users can create their own
AI chatbot with AI studio. Text generation tools for advertisers are built on Lama. Lama helps
power the conversational assistant that is part of META's hit Rayban glasses and the feature in
the Quest headset that lets users ask questions about their surroundings. The company is said to be
developing its own AI-powered search engine and outside its wall.
Lama models have been downloaded over 600 million times on sites like open source AI community
hugging face. Still, the pivot has perplexed many meta-watchers. The company has spent billions
to build the Lama models. On its third quarter earnings column, it announced that it projects
capital expenditures for 2024 to reach as high as $40 billion, with a, quote, significant increase,
likely in 2025. Meanwhile, it's giving Lama away for free to thousands of companies,
including giants like Goldman Sachs, AT&T, and Accenture.
Some investors are struggling to understand where and when exactly meta's revenue would start to justify the eye-watering spend.
Nonetheless, Lama's contrarian success has allowed Zuckerberg to shrug off the lukewarm response to his metaverse ambitions and the company's painful year of efficiency in late 2022 and early 2023.
The rise of Lama has also given Zuckerberg a chance to address a long, simmering, sore point in his otherwise meteoric career.
The fact that Facebook and now META have so often seen their services and products constrained,
by rules imposed by Apple and Google, the rival giants whose app stores are Meta's primary
points of distribution in the mobile device era. As he wrote in a July blog post, we must ensure that
we always have access to the best technology and that we're not locked into a competitor's
closed ecosystem where they can restrict what we build. With Lama, Mena and Zuckerberg have the
chance to set a new industry standard. I think we're going to look back at Lama 3.1 as an
inflection point in the industry where open source AI started to become the industry
just like Linux is, he said on Meta's July earnings call, invoking the open source project
that disrupted the dominance of proprietary operating systems like Microsoft Windows, end
quote. And from the Financial Times, a long read, and I mean a long one, but a very,
very interesting, detailed look at how AI, as we've discussed before, might be the technology
that unlocks robotics at scale, quote, the extraordinary progress in generating text and
images using AI over the past two years have been accomplished by the invention of large language
models, the systems underpinning chatbots. Roboticists are now building on these and their cousins
visually conditioned language models, sometimes called vision language models, which connect
textual information and imagery. With access to huge existing troves of text and image data,
researchers can pre-train their robot models on the nuances of the physical world and how humans
describe it, even before they begin to teach their machine students' specific actions.
But in the chaotic, ever-changing real world, machines not only need to be able to execute individual tasks such as these,
they also need to carry out a multitude of jobs in different settings.
Those at the heart of robotics believe the answer to this generalization is to be found in foundation models for the physical world,
which will draw on growing databases relating to movement banks of information recording robot actions.
The hope is these large behavioral models, once big enough, will help machines adapt to new,
and unpredictable environments, such as commercial and domestic settings, rapidly transforming
business and our home lives. But these models face numerous challenges beyond what is required
for linguistic generative AI. They have to drive actions that obey the laws of physics in a three-dimensional
world and adapt to dynamic environments occupied by other living things. The current challenge for
developing these large behavioral models is the scarcity of data, a difficulty also facing
large language models as a result of human information sources being exhausted, but a great
communal effort is underway among the robotics community to generate new sets of training data.
We've been seeing more data be available, including data for very dexterous tasks, and seeing a lot
of the fruits of putting that data in, says Stanford's Finn. It suggests that if we are able to scale
things up further, then we might be able to make significant breakthroughs in allowing robots
to be successful in real-world environments. Click through on this one, because tons of graphics
are in there to explain everything.
The weekend bonus episode this weekend is me on another podcast, the Daily Detroit
podcast.
I'm sharing this one because the topic we discuss is something that I've not really been
able to get into on this show, the how and why I think it's so hard to seed new tech
hubs in places beyond Silicon Valley.
And crucially, what I think regions should do to actually make that happen.
Listen, enjoy, talk to you on Monday.
