Tech Brew Ride Home - Thu. 11/21 – What The DOJ Wants Google To Do

Starting point is 00:00:00 On April 4th, 2023, around 2 in the morning, a man was found stabbed multiple times on a sidewalk in downtown San Francisco. Hey, who did this to you? What happened next turned the story into a political firestorm. Reports have identified the victim as Bob Lee, the founder of Cash App. From Bloomberg Podcasts, this is Foundering, the Killing of Bob Lee, beginning April 16. Welcome to the Tech meme right home for Thursday, November 21st, 2024, I'm Brian McCullough today. The DOJ has filed its remedy for Google, but what would it mean for end users if their recommendations actually come to pass? InVIDIA's earnings

Starting point is 00:00:47 continue to be historic, but are they worried about current AI models hitting a wall? How AI might help make quantum computing become reality and did a major new AI player just release its first product? Here's what you miss today in the world of tech. It's officially official. The DOJ has asked the federal judge to force Google to sell Chrome, restrict Android from favoring Google's search engine, ban default search deals on iOS and other devices, require Google to allow websites more ability to opt out of its AI products, and provide more ad placement controls to advertisers and syndicate its search results to rival search engines for at least a decade. Google says the DOJ's, quote, wildly overbroad proposal goes miles beyond the court's

Starting point is 00:01:35 decision, end quote, would hurt U.S. consumers and jeopardize U.S. global tech leadership. Quoting the Associated Press. Although regulators stopped short of demanding Google sell Android 2, they asserted the judge should make it clear the company could still be required to divest its smartphone operating system if its oversight committee continues to see evidence of misconduct. A sale of Chrome, quote, will permanently stop Google's control of this critical search access point and allow rival search engines the ability to access the browser that for many users is a gateway to the internet. Justice Department lawyers argued in their filing. The Justice Department decision makers, who will inherit the case after President-elect Donald Trump takes office next year,

Starting point is 00:02:16 might not be as strident. The Washington, D.C. court hearings on Google's punishment are scheduled to begin in April, and Judge Meta is aiming to issue his final decision before Labor Day. If Meta embraces the government's recommendations, Google would be forced to sell its 16-year-old Chrome browser within six months of the final ruling. But the company certainly would appeal any punishment, potentially prolonging a legal tussle that has dragged on for more than four years. Besides seeking a Chrome spin-off and corraling of the Android software, the Justice Department wants the judge to ban Google from forging multi-billion-dollar deals to lock in its dominant search engine as the default option on Apple's iPhone and other devices. It would also ban Google from

Starting point is 00:02:57 favoring its own services, such as YouTube or its recently launched artificial intelligence platform Gemini. Regulators also want Google to license the search index data it collects from people's queries to its rivals, giving them a better chance at competing with the tech giant. On the commercial side of its search engine, Google would be required to provide more transparency into how it sets the prices that advertisers pay to be listed near the top of some targeted search results, end quote. Instead of going into further details about this remedy, I looked around for a piece describing what this might mean for users if it were to actually come to pass. Here you go, quoting INews.

Starting point is 00:03:36 If it were to go ahead and it remains a big if, as Google could still make the case against it, users may well see more friction in how easily they can access services like Gmail, Google's email service through a web browser or Google Drive. Any potential change in ownership could impact how Google tracks web browsers and collects data that is the lifeblood of its business. This balance between data collection and privacy is something a new owner would have to navigate with unknown impacts on how it would operate. A new owner could also change a host of product features from the look and feel of the browser to the way results are displayed or the way

Starting point is 00:04:09 businesses advertise on it. For advertisers, the ability to see in as much detail what users are interested in would be affected by the unbundling of Chrome from Google. If it is ultimately sold to a smaller company, it also raises the question of whether new owners would have the resources to continue to invest in the product and whether the user experience might degrade or be overtaken by rivals. The U.S. Government Department stopped short of suggesting Google should divest its Android operating system, which would have huge effects on everyday users, but it's likely that is being held back as a stick to use should Google not comply with the current decision. Google has said it will be filing its own counter-proposal to the judge's decision by year-end

Starting point is 00:04:47 and will be making its case against the decision in 2025. The big unanswered question from the judge's blockbuster decision this week is who exactly would be able to buy the Chrome browser and what would happen as a result. Amazon and OpenAI, the makers of ChatGPT, have both been touted as potential suitors with enough money to buy the browser, which has been valued by some as worth $20 billion, but neither seems likely. Amazon is facing its own antitrust investigation, with the purchase of the leading web browser unlikely to help its own case, that it does not hold an unfair market dominance, while OpenAI is the leader in generative AI, and as some, could see it argued that they too would wield too much power post-purchase, end quote.

Starting point is 00:05:34 And this is another thing that we've spoken about recently, I think even earlier this week. The CFPB will supervise tech companies offering digital wallets. The likes of Apple Pay, Google Pay, and Venmo, they'll treat companies with more than 50 million annual transactions now as banks, quoting Bloomberg. The original proposal set the supervision threshold at 5 million transactions, a year, while the financial regulator can already take action against companies that break the law, the new rule would allow the CFPB to regularly supervise the large digital wallet and payments firms and their practices. More consumers are turning to digital wallets and payment apps to

Starting point is 00:06:12 complete everyday transactions and competition in the area has intensified with Apple Pay leading the pack. Digital wallet use among U.S. consumers jumped to 62% last year from around 47% in 2022, according to Federal Reserve surveys. Since the The CFPB proposed the rule last year. Apple opened up use of its near-field communication payment chip, changing its long-held practice of limiting banks or other payment firms' use of the technology. The strategy shift followed a deal with European Union financial regulators that required the Cupertino-California-based company to provide free access to its wallet technology for a decade. PayPal Holdings recently disclosed that it's working with the

Starting point is 00:06:51 CFPB to answer questions about backup payment options in its own digital wallet. The final rule will take effect 30 days after it's officially published in the Federal Register, end quote. Vida earnings continue to be historic Q3 revenue was up 94% year over year to $35.1 billion above estimates by about $2 billion. Data Center revenue was up 112% to $30.8 billion, and crucially, NVIDIA forecast Q4 revenue above estimates. For our purposes, what I found interesting was Jensen Huang saying, the current AI paradigm isn't broken, quoting TechCrunch. On its earnings call, analyst prodded CEO Jensen Huang about how Nvidia would fare if tech companies started using new methods to improve

Starting point is 00:07:45 their AI models. The method that underpins OpenAI's 01 model or time test scaling came up quite a lot. It's the idea that AI models will give better answers if you give them more time and computing power to, quote, think through questions. Specifically, it adds more compute to the AI inference phase, which is everything that happens after a user hits enter on their prompt. NVIDIA's CEO was asked whether he was seeing AI model developers shift over to these new methods and how NVIDIA's older chips would work for AI inference. Huang indicated that 01 and test time scaling more broadly could play a larger role in NVIDIA's business moving forward, calling it, quote, one of the most exciting developments and a new scaling law. Huang did his best

Starting point is 00:08:28 to assure investors that NVIDIA is well positioned for the change. The NVIDIA CEO's remarks aligned with what Microsoft CEO such a Nadella said on stage at a Microsoft event on Tuesday. O-1 represents a new way for the AI industry to improve its models. This is a big deal for the chip industry because it places a greater emphasis on AI inference. While Nvidia's chips are the gold standard for training AI models, there's a broad set of well-funded startups crafting lightning-fast AI inference chips, such as GROC and Cerebrus. It could be a more competitive space for Nvidia to operate in.

Starting point is 00:09:01 Despite recent reports that improvements in generative models are slowing, Huang told analysts that AI model developers are still improving their models by adding more compute and data during the pre-training phase. Anthropic CEO Dario Ammodai also said on Wednesday during an on-stage interview at the Cerebral Valley Summit in San Francisco that he is not seeing a slowdown in model development. Foundation model pre-training scaling is intact and it's continuing, said Huang on Wednesday. As you know, this is an empirical law, not a fundamental physical law, but the evidence is that it continues to scale. What we're learning, however, is that it's not enough, end quote. That's certainly what Nvidia investors wanted to hear since the chipmaker's stock has soared more than 180% in 2024 by selling the AI chips that OpenAI, Google, and Meta train their models on. However, Andresen Horowitz partners and several other AI executives have previously said that these models are already starting to show diminishing returns.

Starting point is 00:09:56 Wang noted that most of Nvidia's computing workloads today are around the pre-training of AI models, not inference. But he attributed that more to where the AI world is today. He said that one day there will simply be more people running AI models, meaning more AI inference will happen. Wang noted that Nvidia is the largest inference platform in the world today, and the company's scale and reliability gives it a huge advantage compared to startups. Our hopes and dreams are that someday the world does a ton of inference, and that's when

Starting point is 00:10:26 AI has really succeeded, said Huang. Everyone knows that if they innovate on top of Kuda and NVIDIA's architecture, they can innovate more quickly, and they know that everything should work, end quote. Google researchers have introduced Alpha-Cubit, a machine learning decoder that surpasses existing methods for identifying and correcting quantum computer errors. This could be a big deal, quoting Quantum Insider. Quantum computers, which leverage principles like superposition and entanglement are poised to solve specific problems exponentially faster than classical machines according to the post. However, qubits, the building blocks of quantum computers, are highly susceptible to noise leading to frequent errors. Overcoming this vulnerability is critical to scaling

Starting point is 00:11:14 quantum devices for practical applications. To counteract this, quantum error correction uses redundancy. Multiple physical cubits are grouped into a single logical cubit, and consistency checks are performed to detect and correct errors. The challenge lies, in decoding these checks efficiently and accurately, especially as quantum processors scale up. Current hardware typically exhibits error rates of 1 to 10% per operation, far too high for reliable computations. Future systems will require error rates below 0.0000000001% for practical applications like drug discovery materials design and cryptographic tasks. Alpha-Cubit is built on the transformer architecture.

Starting point is 00:11:59 Transformer refers to a type of neural network architecture designed to process sequential data efficiently by, for example, focusing on the most important parts of the data it analyzes. This helps Alpha-Cubit to decode quantum errors accurately. As the name suggests, neural networks are meant to mimic the human brain's neurons, generally speaking. Just like people have to learn before they master a new skill and continually hone that skill, neural networks have to learn and practice too. Alpha-Cubit employs a two-stage training process, pre-training and fine-tuning.

Starting point is 00:12:29 In the pre-training phase, the model is first exposed to synthetic examples generated by a quantum simulator. This enables it to learn general error patterns under various noise conditions. Then the system goes through the fine-tuning. Here the model is further trained on real-world error data from Google's sycamore processor, tailoring it to the specific noise characteristics of the hardware. The decoder adapts to complex error types, including crosstalk, unwinned qubit interactions and leakage, cubits drifting into non-computational states. It also utilizes soft readouts, probabilistic measurements that provide richer information about cubit states.

Starting point is 00:13:04 The team suggests that their success with Alpha Cubits performance represents a significant step forward in the integration of machine learning and quantum computing. By automating the decoding process, the model reduces the reliance on handcrafted algorithms, which often struggle with the complexity of real-world noise, end quote. In other words, AI to make quantum computing possible, possibly. Sources are telling the journal that XAI has told investors it raised $5 billion in a funding round valuing it at $50 billion, and that its revenue has reached $100 million on an annualized basis. Quote, XAI was valued at $24 billion when it raised $6 billion in the spring. XAI's primary product is its Grock Chatbot available to premium subscribers of Musk's social network X.

Starting point is 00:13:56 The company also recently made Grok available to business customers. GROC was launched in November 2023, making it late to a race with competitors including OpenAI, Anthropic, and Alphabet's Google. XAI spent this past summer constructing a data center in Memphis, Tennessee that houses 100,000 Nvidia chips for building its AI models. Musk has said the Memphis Data Center contains the most powerful AI cluster in the world and that he is planning to double its size. Musk is particularly focused on beating OpenAI, the chat GPT creator he co-founded in 2015.

Starting point is 00:14:26 He has sued the startup and its chief executive Sam Altman for alleged fraud and antitrust violations, claims Open AI have called baseless. XAI is set to debut the third version of its GROC language model in December. Musk has said it will be, quote, the world's most powerful AI by every metric, end quote. And finally today, an interesting raise, because it might signal a significant new player in the AI space. French startup H has announced its first product, Runner H, H, an agentic, AI model available in private beta. H raised a lot of eyebrows when it raised a $220 million seed round back in May.

Starting point is 00:15:09 You heard that right, $220 million seed round. Quoting TechCrunch. Runner H is built atop the startup's own proprietary compact LLM based on just 2 billion parameters. H has set up a wait list for Runner H on its site. CEO Charles Cantor said that it will be releasing APIs to those on the list over the coming days to use agents off-the-shelf that have been pre-built by H as well as for developers to create their own. Access to the API will also come along with access to something called H-studio to test and manage how these services work. Initially, using those APIs will be free and later there

Starting point is 00:15:45 will be a payment model introduced. Even using compact LLMs, building and running AI is not cheap, especially as competition continues to raise money to develop their own products. TechCrunch has also confirmed that H is raising a series A to build what Cantor describes as part of the second era of AI, with LLM companies like OpenAI being part of the first era. Runner H will initially focus on three specific use cases, robotic process automation or RPA, quality assurance, and business process outsourcing. RPA is an area that has existed for years using basic scripts to automate the most repetitive tasks that humans have had to perform, such as reading forms, checking boxes, and sending files from one place to another. In fact, a lot of

Starting point is 00:16:28 RPA has never been built with AI baked in even after AI started to develop advanced skills. The idea with Rner H is that it will be able to run RPA across forms, sites, and other templates, even when they have been modified, something that might have broken previous scripts, and across a much wider range of sources. Quality assurance can cover a wide range of applications, but Cantor said that one of the most popular so far has been reducing the maintenance burdens around website testing, validating page availability, simulating real user actions, or ensuring compatibility across payment methods, in particular when modifications have been made.

Starting point is 00:17:04 BPO is a catch-all area that will cover not just fixing and improving billing processes, but also speeding up how an agent can use and access data from different sources and more. There has been a race among foundational AI companies around how many parameters are going into LLMs. GPT4, for example, has 175 billion parameters, but runner H is taking a very different approach with just two billion parameters, both for its LLM and for its computer vision-based VLM. Cantor's argument is that this makes them significantly more efficient in terms of costs and operations key when working on winning and keeping business deals

Starting point is 00:17:38 and H's own operational costs. We are specialists, he said. We are building for the agentic era. The company also claims that it works. It says that its compact model outperforms Anthropics, computer use by 29% based on Web Voyager benchmarks, as well as models from Mistral and meta, end quote. Nothing more for you today. Talk to you tomorrow.

Tech Brew Ride Home - Thu. 11/21 – What The DOJ Wants Google To Do

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.