Tech Brew Ride Home - Thu. 11/21 – What The DOJ Wants Google To Do
Episode Date: November 21, 2024The DOJ has filed its “remedy” for Google but what would it mean for end users if their recommendations actually come to pass? Nvidia’s earnings continue to be historic but are they worried abou...t current AI models hitting a wall? How AI might help make quantum computing become reality. And did a major new AI player just release its first product? Sponsors: Lumen.me/ride Links: US regulators seek to break up Google, forcing Chrome sale as part of monopoly punishment (Associated Press) Google could be forced to sell Chrome – here’s what it would mean for users (iNews) Apple Pay, Other Tech Firms Come Under CFPB Regulatory Oversight (Bloomberg) Nvidia’s CEO defends his moat as AI labs change how they improve their AI models (TechCrunch) AI Power For Quantum Errors: Google Develops AlphaQubit to Identify, Correct Quantum Errors (Quantum Insider) Elon Musk’s xAI Startup Is Valued at $50 Billion in New Funding Round (WSJ) H, the AI startup that raised $220M, launches its first product: Runner H for ‘agentic’ applications (TechCrunch) Learn more about your ad choices. Visit megaphone.fm/adchoices
Transcript
Discussion (0)
On April 4th, 2023, around 2 in the morning, a man was found stabbed multiple times on a sidewalk in downtown San Francisco.
Hey, who did this to you?
What happened next turned the story into a political firestorm.
Reports have identified the victim as Bob Lee, the founder of Cash App.
From Bloomberg Podcasts, this is Foundering, the Killing of Bob Lee, beginning April 16.
Welcome to the Tech meme right home for Thursday, November 21st,
2024, I'm Brian McCullough today. The DOJ has filed its remedy for Google, but what would
it mean for end users if their recommendations actually come to pass? InVIDIA's earnings
continue to be historic, but are they worried about current AI models hitting a wall? How
AI might help make quantum computing become reality and did a major new AI player just release
its first product? Here's what you miss today in the world of tech. It's officially official. The
DOJ has asked the federal judge to force Google to sell Chrome, restrict Android from favoring Google's
search engine, ban default search deals on iOS and other devices, require Google to allow
websites more ability to opt out of its AI products, and provide more ad placement controls to
advertisers and syndicate its search results to rival search engines for at least a decade.
Google says the DOJ's, quote, wildly overbroad proposal goes miles beyond the court's
decision, end quote, would hurt U.S. consumers and jeopardize U.S. global tech leadership.
Quoting the Associated Press. Although regulators stopped short of demanding Google sell Android 2,
they asserted the judge should make it clear the company could still be required to divest its
smartphone operating system if its oversight committee continues to see evidence of misconduct.
A sale of Chrome, quote, will permanently stop Google's control of this critical search access point
and allow rival search engines the ability to access the browser that for many users is a gateway to the internet.
Justice Department lawyers argued in their filing. The Justice Department decision makers,
who will inherit the case after President-elect Donald Trump takes office next year,
might not be as strident. The Washington, D.C. court hearings on Google's punishment
are scheduled to begin in April, and Judge Meta is aiming to issue his final decision before Labor Day.
If Meta embraces the government's recommendations, Google would be forced to sell its 16-year-old
Chrome browser within six months of the final ruling. But the company certainly would appeal any
punishment, potentially prolonging a legal tussle that has dragged on for more than four years.
Besides seeking a Chrome spin-off and corraling of the Android software, the Justice Department
wants the judge to ban Google from forging multi-billion-dollar deals to lock in its dominant search
engine as the default option on Apple's iPhone and other devices. It would also ban Google from
favoring its own services, such as YouTube or its recently launched artificial intelligence
platform Gemini. Regulators also want Google to license the search index data it collects from
people's queries to its rivals, giving them a better chance at competing with the tech giant.
On the commercial side of its search engine, Google would be required to provide more transparency
into how it sets the prices that advertisers pay to be listed near the top of some targeted search
results, end quote. Instead of going into further details about this remedy,
I looked around for a piece describing what this might mean for users if it were to actually come to pass.
Here you go, quoting INews.
If it were to go ahead and it remains a big if, as Google could still make the case against it,
users may well see more friction in how easily they can access services like Gmail,
Google's email service through a web browser or Google Drive.
Any potential change in ownership could impact how Google tracks web browsers and collects data
that is the lifeblood of its business.
This balance between data collection and privacy is something a new owner would have to navigate
with unknown impacts on how it would operate. A new owner could also change a host of product
features from the look and feel of the browser to the way results are displayed or the way
businesses advertise on it. For advertisers, the ability to see in as much detail what users are
interested in would be affected by the unbundling of Chrome from Google. If it is ultimately
sold to a smaller company, it also raises the question of whether new owners would have the
resources to continue to invest in the product and whether the user experience might degrade or be
overtaken by rivals. The U.S. Government Department stopped short of suggesting Google should divest
its Android operating system, which would have huge effects on everyday users, but it's likely
that is being held back as a stick to use should Google not comply with the current decision.
Google has said it will be filing its own counter-proposal to the judge's decision by year-end
and will be making its case against the decision in 2025. The big unanswered question from the
judge's blockbuster decision this week is who exactly would be able to buy the Chrome browser
and what would happen as a result. Amazon and OpenAI, the makers of ChatGPT, have both been touted
as potential suitors with enough money to buy the browser, which has been valued by some as worth
$20 billion, but neither seems likely. Amazon is facing its own antitrust investigation,
with the purchase of the leading web browser unlikely to help its own case, that it does
not hold an unfair market dominance, while OpenAI is the leader in generative AI, and as some,
could see it argued that they too would wield too much power post-purchase, end quote.
And this is another thing that we've spoken about recently, I think even earlier this week.
The CFPB will supervise tech companies offering digital wallets.
The likes of Apple Pay, Google Pay, and Venmo, they'll treat companies with more than 50 million
annual transactions now as banks, quoting Bloomberg.
The original proposal set the supervision threshold at 5 million transactions,
a year, while the financial regulator can already take action against companies that break the law,
the new rule would allow the CFPB to regularly supervise the large digital wallet and payments
firms and their practices. More consumers are turning to digital wallets and payment apps to
complete everyday transactions and competition in the area has intensified with Apple Pay leading
the pack. Digital wallet use among U.S. consumers jumped to 62% last year from around 47% in
2022, according to Federal Reserve surveys. Since the
The CFPB proposed the rule last year. Apple opened up use of its near-field communication
payment chip, changing its long-held practice of limiting banks or other payment firms' use
of the technology. The strategy shift followed a deal with European Union financial regulators
that required the Cupertino-California-based company to provide free access to its wallet
technology for a decade. PayPal Holdings recently disclosed that it's working with the
CFPB to answer questions about backup payment options in its own digital wallet. The final rule will
take effect 30 days after it's officially published in the Federal Register, end quote.
Vida earnings continue to be historic Q3 revenue was up 94% year over year to $35.1 billion
above estimates by about $2 billion. Data Center revenue was up 112% to $30.8 billion,
and crucially, NVIDIA forecast Q4 revenue above estimates. For our purposes, what I found
interesting was Jensen Huang saying,
the current AI paradigm isn't broken, quoting TechCrunch. On its earnings call, analyst prodded CEO
Jensen Huang about how Nvidia would fare if tech companies started using new methods to improve
their AI models. The method that underpins OpenAI's 01 model or time test scaling came up
quite a lot. It's the idea that AI models will give better answers if you give them more time and
computing power to, quote, think through questions. Specifically, it adds more compute to the AI
inference phase, which is everything that happens after a user hits enter on their prompt.
NVIDIA's CEO was asked whether he was seeing AI model developers shift over to these new methods
and how NVIDIA's older chips would work for AI inference. Huang indicated that 01 and test time
scaling more broadly could play a larger role in NVIDIA's business moving forward, calling
it, quote, one of the most exciting developments and a new scaling law. Huang did his best
to assure investors that NVIDIA is well positioned for the change. The NVIDIA
CEO's remarks aligned with what Microsoft CEO such a Nadella said on stage at a Microsoft event on Tuesday.
O-1 represents a new way for the AI industry to improve its models.
This is a big deal for the chip industry because it places a greater emphasis on AI inference.
While Nvidia's chips are the gold standard for training AI models,
there's a broad set of well-funded startups crafting lightning-fast AI inference chips,
such as GROC and Cerebrus.
It could be a more competitive space for Nvidia to operate in.
Despite recent reports that improvements in generative models are slowing, Huang told analysts that AI model developers are still improving their models by adding more compute and data during the pre-training phase.
Anthropic CEO Dario Ammodai also said on Wednesday during an on-stage interview at the Cerebral Valley Summit in San Francisco that he is not seeing a slowdown in model development.
Foundation model pre-training scaling is intact and it's continuing, said Huang on Wednesday.
As you know, this is an empirical law, not a fundamental physical law, but the evidence is that it continues to scale.
What we're learning, however, is that it's not enough, end quote.
That's certainly what Nvidia investors wanted to hear since the chipmaker's stock has soared more than 180% in 2024
by selling the AI chips that OpenAI, Google, and Meta train their models on.
However, Andresen Horowitz partners and several other AI executives have previously said that these models are already starting to show diminishing returns.
Wang noted that most of Nvidia's computing workloads today are around the pre-training of AI models,
not inference.
But he attributed that more to where the AI world is today.
He said that one day there will simply be more people running AI models, meaning more AI
inference will happen.
Wang noted that Nvidia is the largest inference platform in the world today, and the company's
scale and reliability gives it a huge advantage compared to startups.
Our hopes and dreams are that someday the world does a ton of inference, and that's when
AI has really succeeded, said Huang. Everyone knows that if they innovate on top of Kuda and
NVIDIA's architecture, they can innovate more quickly, and they know that everything should work,
end quote. Google researchers have introduced Alpha-Cubit, a machine learning decoder that
surpasses existing methods for identifying and correcting quantum computer errors. This could be a big
deal, quoting Quantum Insider. Quantum computers, which leverage principles like superposition
and entanglement are poised to solve specific problems exponentially faster than classical machines
according to the post. However, qubits, the building blocks of quantum computers, are highly
susceptible to noise leading to frequent errors. Overcoming this vulnerability is critical to scaling
quantum devices for practical applications. To counteract this, quantum error correction
uses redundancy. Multiple physical cubits are grouped into a single logical cubit,
and consistency checks are performed to detect and correct errors. The challenge lies,
in decoding these checks efficiently and accurately, especially as quantum processors scale up.
Current hardware typically exhibits error rates of 1 to 10% per operation, far too high for reliable
computations. Future systems will require error rates below 0.0000000001% for practical applications
like drug discovery materials design and cryptographic tasks.
Alpha-Cubit is built on the transformer architecture.
Transformer refers to a type of neural network architecture designed to process sequential data efficiently
by, for example, focusing on the most important parts of the data it analyzes.
This helps Alpha-Cubit to decode quantum errors accurately.
As the name suggests, neural networks are meant to mimic the human brain's neurons, generally
speaking.
Just like people have to learn before they master a new skill and continually hone that skill,
neural networks have to learn and practice too.
Alpha-Cubit employs a two-stage training process, pre-training and fine-tuning.
In the pre-training phase, the model is first exposed to synthetic examples generated by a quantum simulator.
This enables it to learn general error patterns under various noise conditions.
Then the system goes through the fine-tuning.
Here the model is further trained on real-world error data from Google's sycamore processor,
tailoring it to the specific noise characteristics of the hardware.
The decoder adapts to complex error types, including crosstalk,
unwinned qubit interactions and leakage, cubits drifting into non-computational states. It also utilizes
soft readouts, probabilistic measurements that provide richer information about cubit states.
The team suggests that their success with Alpha Cubits performance represents a significant step forward
in the integration of machine learning and quantum computing. By automating the decoding process,
the model reduces the reliance on handcrafted algorithms, which often struggle with the complexity of
real-world noise, end quote. In other words,
AI to make quantum computing possible, possibly.
Sources are telling the journal that XAI has told investors it raised $5 billion in a funding round valuing it at $50 billion, and that its revenue has reached $100 million on an annualized basis.
Quote, XAI was valued at $24 billion when it raised $6 billion in the spring.
XAI's primary product is its Grock Chatbot available to premium subscribers of Musk's social network X.
The company also recently made Grok available to business customers.
GROC was launched in November 2023, making it late to a race with competitors including
OpenAI, Anthropic, and Alphabet's Google.
XAI spent this past summer constructing a data center in Memphis, Tennessee that houses
100,000 Nvidia chips for building its AI models.
Musk has said the Memphis Data Center contains the most powerful AI cluster in the world
and that he is planning to double its size.
Musk is particularly focused on beating OpenAI, the chat GPT creator he co-founded in 2015.
He has sued the startup and its chief executive Sam Altman for alleged fraud and antitrust violations,
claims Open AI have called baseless.
XAI is set to debut the third version of its GROC language model in December.
Musk has said it will be, quote, the world's most powerful AI by every metric, end quote.
And finally today, an interesting raise, because it might signal a significant new player in the AI space.
French startup H has announced its first product, Runner H, H, an agentic,
AI model available in private beta.
H raised a lot of eyebrows when it raised a $220 million seed round back in May.
You heard that right, $220 million seed round.
Quoting TechCrunch.
Runner H is built atop the startup's own proprietary compact LLM based on just 2 billion parameters.
H has set up a wait list for Runner H on its site.
CEO Charles Cantor said that it will be releasing APIs to those on the list over the coming
days to use agents off-the-shelf that have been pre-built by H as well as for developers to create
their own. Access to the API will also come along with access to something called H-studio to test
and manage how these services work. Initially, using those APIs will be free and later there
will be a payment model introduced. Even using compact LLMs, building and running AI is not cheap,
especially as competition continues to raise money to develop their own products. TechCrunch has
also confirmed that H is raising a series A to build
what Cantor describes as part of the second era of AI, with LLM companies like OpenAI being part of the
first era. Runner H will initially focus on three specific use cases, robotic process automation or
RPA, quality assurance, and business process outsourcing. RPA is an area that has existed for years
using basic scripts to automate the most repetitive tasks that humans have had to perform,
such as reading forms, checking boxes, and sending files from one place to another. In fact, a lot of
RPA has never been built with AI baked in even after AI started to develop advanced skills.
The idea with Rner H is that it will be able to run RPA across forms, sites, and other templates,
even when they have been modified, something that might have broken previous scripts,
and across a much wider range of sources.
Quality assurance can cover a wide range of applications, but Cantor said that one of the most popular so far
has been reducing the maintenance burdens around website testing,
validating page availability, simulating real user actions, or ensuring compatibility across payment methods,
in particular when modifications have been made.
BPO is a catch-all area that will cover not just fixing and improving billing processes,
but also speeding up how an agent can use and access data from different sources and more.
There has been a race among foundational AI companies around how many parameters are going into LLMs.
GPT4, for example, has 175 billion parameters,
but runner H is taking a very different approach with just two billion parameters,
both for its LLM and for its computer vision-based VLM.
Cantor's argument is that this makes them significantly more efficient
in terms of costs and operations key when working on winning and keeping business deals
and H's own operational costs.
We are specialists, he said.
We are building for the agentic era.
The company also claims that it works.
It says that its compact model outperforms Anthropics,
computer use by 29% based on Web Voyager benchmarks, as well as models from Mistral and
meta, end quote. Nothing more for you today. Talk to you tomorrow.
