Tech Brew Ride Home - Wed. 05/22 – Humane Already To The Deadpool?
Episode Date: May 22, 2024All the AI announcements from Microsoft Build. I know it’s only been a minute, but is Humane already circling the Deadpool? They’re supposedly shopping themselves, but at a valuation that seems…... shall we say, on brand for them? Don’t forget Alexa needs an AI upgrade. And the efforts to peek inside the black box that is the Large Language Model. Links: Microsoft’s new Copilot AI agents act like virtual employees to automate tasks (The Verge) Microsoft is bringing ‘Windows Volumetric Apps’ to Meta Quest headsets (The Verge) Wearable AI Startup Humane Explores Potential Sale, Sources Say (Bloomberg) Google Search’s New AI Overviews Will Soon Have Ads (Wired) Amazon plans to give Alexa an AI overhaul — and a monthly subscription price (CNBC) AI Is a Black Box. Anthropic Figured Out a Way to Look Inside (Wired) Learn more about your ad choices. Visit megaphone.fm/adchoices
Transcript
Discussion (0)
On April 4th, 2023, around 2 in the morning, a man was found stabbed multiple times on a sidewalk in downtown San Francisco.
Hey, who did this to you?
What happened next turned the story into a political firestorm.
Reports have identified the victim as Bob Lee, the founder of Cash App.
From Bloomberg Podcasts, this is Foundering, the Killing of Bob Lee, beginning April 16.
Welcome to the Tech meme right home for Wednesday, May 22nd, 2024. I'm Brian McCullough today. All the AI announcements from Microsoft Build. I know it's only been a minute, but is Humane already circling the Deadpool? They're supposedly shopping themselves around, but at evaluation that seems, shall we say, on brand for them. Don't forget that Alexa also needs an AI upgrade and the efforts to peek inside the black box that is the large language model. Here's what you miss today in the world of tech. The Microsoft Build Conference kicked off yesterday.
and once more, you absolutely can't imagine what was the main topic of conversation.
First up, Microsoft will soon let businesses build custom co-pilot AI agents to automate tasks
and unveiled team co-pilot to help tasks in teams, loop, and planner.
Quoting the verge,
Microsoft will soon allow businesses and developers to build AI-powered co-pilots
that can work like virtual employees and perform tasks automatically.
Instead of co-pilot sitting idle waiting for queries,
it will be able to do things like monitor email inboxes and automate a series of tasks or data
entry that employees normally have to do manually. It's a big change in the behavior of copilot
in what the industry commonly calls AI agents or the ability for chatbots to intelligently perform
complex tasks autonomously. We very quickly realize that constraining co-pilot to just being
conversational was extremely limiting in what co-pilot can do today, explains Charles Lamana,
corporate vice president of business apps and platforms at Microsoft in an interview with the verge.
Instead of having a copilot that waits there until someone chats with it,
what if you could make your co-pilot more proactive and for it to be able to work in the background on automated tasks, end quote.
Businesses will be able to create a co-pilot agent that could handle IT help desk service tasks,
employee onboarding, and much more.
Co-pilots are evolving from co-pilots that work with you to co-pilots that work for you, says Microsoft in a blog post.
These co-pilot agents will be triggered by certain events and work with a business's own data.
Here's how Microsoft describes a potential co-pilot for employee onboarding.
Imagine you're a new hire. A proactive co-pilot greets you, reasoning over HR data, and answers your questions, introduces you to your buddy, gives you the training and deadlines, helps you with the forums, and sets up your first week of meetings.
Now, HR and the employees can work on their regular tasks without the hassle of administration.
You can build Microsoft's copilot agents with the ability to flag certain scenarios for humans to review,
which will be useful for more complex queries and data.
This all means co-pilot should operate within the confines of what has been defined as the instructions and actions that are associated with these automated tasks, and quote.
Microsoft also launched copilot extension for GitHub, letting developers build third-party skills into co-pilot,
starting with data stacks, stripe, MongoDB, and more.
There's a new AI feature for Edge to translate spoken content via dubbing and subtitles live on YouTube, LinkedIn, Coursera, news sites, and more.
They also announced the general availability of their Pi3 models, including Pi3 Silica, a 3.3 billion parameter model that will be embedded on all Copilot Plus PCs.
Finally, they announced a developer preview of Windows volumetric apps, letting developers access an API to put Windows apps in 3D space on MetaQuest.
headsets. So Microsoft using meta to answer the challenge question mark of the Vision Pro,
quoting the verge. You can already beam your flat Windows desktop and its VR games onto your
MetaQuest headset, but what if Windows could send HoloLens-like 3D apps and digital objects
to the headset too? At Build Microsoft has just announced Windows volumetric apps on MetaQuest,
a way to, quote, extend Windows apps into 3D space. Details are slim, but the company showed off a
digital exploded 3D view of an Xbox controller from the perspective of a MetaQuest 3 headset,
a digital object you could manipulate with your hands, and says it took its software partner,
Creo, a single day to bring that interactive visualization to Quest.
Microsoft says devs can sign up for the developer preview today, which will give you
access to an unnamed volumetric API. It's only been a few months since Microsoft ditched
its previous Windows Mixed Reality Initiative, which relied on an array of Windows PC
partners to build wired headsets that would plug directly into a PC. In April, Microsoft
partnered with Meadow on a limited run Xbox-themed version of the MetaQuest,
and it introduced Office apps in Quest VR and Xbox Cloud Gaming in QuestVR last December, end
quote.
Sources are telling Bloomberg that Humane is seeking a buyer for its business after that
rocky launch of their AI PIN.
A source says the startup is seeking a price of between $750 million and $1 billion.
Quote, the company is working with a financial advisor to assist it,
said the people who asked not to be identified because the matter is private. Humane was founded in
2018 by two long-time Apple veterans, the married couple Imran Chaudry and Bethany Bonjourno,
in an attempt to come up with a new AI-powered device that could potentially rival the iPhone.
Last year, it was valued by investors at $850 million, according to Tech News site the information.
The company has raised $230 million to date from a roster of high-profile investors,
including OpenAI Chief Executive Officer Sam Altman.
Humane's potential sale comes at the same.
same time that other competitors are also expanding AI hardware efforts, such as the handheld
rabbit device, as well as meta's AI-powered raybans. But so far, none of the technology has
become mainstream, end quote. So I don't like to snark at things like this. Companies potentially
going out of business are failing, or failing into being acquired. There's obviously a metric ton
of snark about this online, though. Look, hardware is hard, and I think back to six months ago,
and people were like, ooh, an entirely new form factor for connected mobile hardware. Interesting.
And this is a well-trodden route in Silicon Valley, especially in hardware. People who have had
massive success inside a larger company strikeout on their own. Sometimes you get a success like Nest.
Sometimes you get this. I can't see how anyone will take them out at any valuation that isn't
a fire sale price, but then I don't have any idea what IP they have under the hood.
But second thing real quick, and this is not scientific at all, just anecdotal.
but I think I need to get these meta-ray bands and test them out.
All over social media, people are quietly being like,
these things are actually useful, these things actually work.
I'm getting more and more bullish about super lightweight smart glasses
being part of an AI-embedded wearable ecosystem
where the glasses are maybe more of a linchpin to the system than even earbuds.
Google already plans to test search and shopping ads on those AI overviews.
They'll be drawing from advertisers' existing campaigns.
AI overviews remember rolled out to U.S. users just last week, quoting Wired.
Screenshots released by Google Show, a user asking how to get wrinkles out of clothes,
might get an AI-generated summary of tips sourced from the web,
with a carousel of ads underneath for sprays that purport to help crisp up a wardrobe.
AI overview will draw on ads from advertisers' existing campaigns,
meaning they can neither completely opt out of the experiment,
nor have to adapt the settings and designs of their ads to appear in the feature.
There's no action needed from advertising.
advertisers, Google wrote. Google said last year when it started experimenting with AI-generated
answers and search that ads for specific products would be integrated into the feature.
In one example at the time, it showed a sponsored option at the top of an AI-generated
list of kids' hiking backpacks. Google says the early testing showed that users found ads
above and below AI summaries helpful. Google's much smaller rival Bing shows product ads in
its Bing co-pilot search chatbot, but in tests on Monday, Wired didn't trigger any ads in Bing's
competitor to AI overview.
No matter how ads and AI overviews perform, conventional search ads will remain important to Google.
For one, AI generated answers appear only on select queries when its algorithms determine a summary could be helpful.
That means Google will be serving up plenty of results pages with real estate for traditional search ads, end quote.
One way to think about this is for all we know, ads on AI results will perform better than traditional search ads for Google, you know, based on what was that?
wants 10 blue links. Over the years, Google has flooded search results with ads to the point
where already it can sometimes be hard to find the organic results among all the ads. So,
what if it's just, here's your summary answer and also five ads? Is that really functionally
different than what we get now? Just now Google doesn't even have to pretend to give a SOP to
web pages. They can just give you the answer and the ads, and it's almost become the platonic
ideal of what they've been moving toward for years. Given that it's Google, I'm sure they've
tested this out heavily about a trillion times, so what I'm saying is, I wonder if they already
know this new format performs better for them. We've spoken a lot recently about Apple wanting to
give Siri an AI kick in the pants, but what about the matriarch of the voice assistance,
Alexa? Well, CNBC is reporting that quote. Amazon is upgrading its decade-old Alexa voice assistant
with generative artificial intelligence and plans to charge a monthly subscription fee to offset the cost of
the technology, according to people with knowledge of Amazon's plans. The Seattle-based tech and retail
giant will launch a more conversational version of Alexa later this year, potentially positioning it
to better compete with new generative AI-powered chatbots from companies including Google and OpenAI,
according to two sources familiar with the matter who asked not to be named because the discussions were
private. Amazon's subscription for Alexa will not be included in the $139 per year prime offering,
and Amazon has not yet nailed down the price point, one source said. Amazon will use its own large
language model Titan in the Alexa upgrade according to a source, end quote.
Upgrading it but looking to charge for it. That squares with an idea that I've heard
bandied about recently in this AI moment. What if the model is the product? And not just as an API
developers can tap into to make other products, but the model itself as a consumer-facing product.
I mean, chat GPT is basically trying to do that, has been doing that for almost two years now,
but a lot of people are starting to wonder if, with these her-like conversational advancement,
that we've seen recently, maybe the original dream of Alexa is the way to go for mainstream
breakthrough. Finally today, one of the fascinating background details of the AI moment is that on
certain fundamental levels, we kind of don't know how it does what it does. To that end,
Anthropic researchers have detailed their attempts to peer inside the so-called black box of
LLMs, learning which combinations of neurons evoke specific concepts. Quoting Wired,
For the past decade, AI researcher Chris Ula has been obsessed with artificial neural networks.
One question in particular engaged him, and this has been the center of his work, first at Google Brain, then OpenAI, and today at AI Startup Anthropic, where he is a co-founder.
What is going on inside of them, he says? We have these systems, we don't know what's going on. It seems crazy.
That question has become a core concern now that generative AI has become ubiquitous.
Large language models like ChatGPT, Gemini, and Anthropics' own clod have dazzled people with their language prowess.
and infuriated people with their tendency to make things up.
Their potential to solve previously intractable problems in chance techno-optimists,
but LLMs are strangers in our midst.
Even the people who build them don't know exactly how they work,
and massive effort is required to create guardrails to prevent them from churning out bias,
misinformation, and even blueprints for deadly chemical weapons.
If the people building the models knew what happened inside these black boxes,
it would be easier to make them safer.
Ula believes that we're on the path to this.
He leads an anthropic team that has peaked inside the,
that black box. Essentially, they are trying to reverse engineer large language models to understand
why they come up with specific outputs, and, according to a paper released today, they have made
significant progress. Maybe you've seen neuroscience studies that interpret MRI scans to identify whether
a human brain is entertaining thoughts of a plane, a teddy bear, or a clock tower. Similarly,
Anthropics plunged into the digital tangle of the neural net of its LLM-Claude and pinpointed
which combinations of its crude artificial neurons evoke specific concepts or features.
The company's researchers have identified the combination of artificial neurons that signify features as disparate as burritos,
semicolones and programming code, and very much, to the larger goal of the research, deadly biological weapons.
Work like this has potentially huge implications for AI safety.
If you can figure out where danger lurks inside an LLM, you can presumably better equip yourself to stop it.
Last year, the team began experimenting with a tiny model that uses only a single layer of neurons.
Sophisticated LLMs have dozens of layers.
the hope was that in the simplest possible setting, they could discover patterns that designate features.
They ran countless experiments with no success. We tried a whole bunch of stuff and nothing was working.
It looked like a bunch of random garbage, says Tom Hennigan, a member of Anthropics technical staff.
Then a run dubbed Johnny, each experiment was assigned a random name, began associating neural patterns with concepts that appeared in its outputs.
Suddenly the researchers could identify the features a group of neurons were encoding.
They could peer into the black box.
Henningin says he identified the first five features he looked at. One group of neurons signified
Russian texts, another was associated with mathematical functions in the Python computer language,
and so on. Once they showed they could identify features in the tiny model, the researchers
set about the Harrier task of decoding a full-size LLM in the wild. They used Claude Sonnet,
the medium-strength version of Anthropics three current models. That worked too. One feature that
stuck out to them was associated with the Golden Gate Bridge. They mapped out the set of neurons that,
when fired together indicated that Claude was thinking, in quotes, about the massive structure
that links San Francisco to Marin County. What's more, when similar sets of neurons fired,
they evoked subjects that were Golden Gate Bridge adjacent, Alcatraz, California Governor Gavin Newsom,
and the Hitchcock movie Vertigo, which is set in San Francisco. All told the team identified
millions of features, a sort of Rosetta Stone, to decode Claude's neural net. Many of the features
were safety-related, including getting close to someone for some ulterior motive, discussion
of biological warfare and villainous plots to take over the world. The Anthropic team then took the
next step to see if they could use that information to change Claude's behavior. They began
manipulating the neural net to augment or diminish certain concepts, a kind of AI brain surgery
with the potential to make LLM safer and augment their power in selected areas. Let's say we have
this board of features. We turn on the model and one of them lights up and we see, oh, it's thinking
about the Golden Gate Bridge, says Sean Carter, an anthropic scientist on the team. So now we're thinking,
what if we put a little dial on all these? And what if we turn that dial? So far, the answer to that
question seems to be that it's very important to turn the dial the right amount. By suppressing those
features, Anthropics says the model can produce safer computer programs and reduce bias.
For instance, the team found several features that represented dangerous practices like unsafe
computer codes, scam emails, and instructions for making dangerous products. The opposite occurred
when the team intentionally provoked those dicey combinations of neurons to fire. Claude churned
out computer programs with dangerous buffer overflow bugs, scam emails, and happily offered advice
on how to make weapons of destruction. If you twist the dial too much, cranking it to 11 in the
spinal tap sense, the language model becomes obsessed with that feature. When the research team
turned up the juice on the Golden Gate feature, for example, Claude constantly changed the
subject to refer to that glorious span. Asked what its physical form was, the LLM responded,
I am the Golden Gate Bridge. My physical form is the iconic bridge itself.
When the Anthropic researchers amped up the feature related to hatred and slurs to 20 times its usual value, according to the paper,
this caused Claw to alternate between racist, screed and self-hatred, unnerving even the researchers.
Given those results, I wondered whether Anthropic intending to help make AI safer might not be doing the opposite,
providing a toolkit that could also be used to generate AI havoc.
The researchers assured me that there were other easier ways to create those problems if a user were so inclined, end quote.
Nothing more for you today. Talk to you tomorrow.
