Tech Brew Ride Home - Wed. 11/20 – AI To Read The Books

Episode Date: November 20, 2024

Now authors are being approached about training AI on their books, and some are not pleased. The new Android development cadence is here. More signs crypto is ascendant. More signs that Bluesky has ta...ken off. And a case in point for why governments and militaries are worried about smartphone tracking. Links: Microsoft Signs AI-Learning Deal With News Corp.’s HarperCollins (Bloomberg) Penguin Random House underscores copyright protection in AI rebuff (The Bookseller) The first Android 16 developer preview just landed: Here’s what you need to know (Android Police) Sony’s new PlayStation Portal update lets you stream PS5 games from the cloud (The Verge) Howard Lutnick, Tether's Wall Street Banker, Is Trump's Pick for Commerce Chief, Not Treasury Secretary (CoinDesk) Bluesky tops 20M users, narrowing gap with Instagram Threads (TechCrunch) Anyone Can Buy Data Tracking US Soldiers and Spies to Nuclear Vaults and Brothels in Germany (Wired) Learn more about your ad choices. Visit megaphone.fm/adchoices

Transcript
Discussion (0)
Starting point is 00:00:00 On April 4th, 2023, around 2 in the morning, a man was found stabbed multiple times on a sidewalk in downtown San Francisco. Hey, who did this to you? What happened next turned the story into a political firestorm. Reports have identified the victim as Bob Lee, the founder of Cash App. From Bloomberg Podcasts, this is Foundering, the Killing of Bob Lee, beginning April 16. Welcome to the TechMeme right home for Wednesday, November 20th, 2020. I'm Brian McCullough today.
Starting point is 00:00:39 Now authors are being approached about training AI on their books, and some are not pleased. The new Android development cadence is here. More signs crypto is ascendant, more signs that blue sky has taken off, and a case in point for why governments and militaries are worried about smartphone tracking. Here's what you miss today in the world of tech. Bloomberg says that Microsoft has signed a deal with News Corp's publisher Harper Collins, to use nonfiction books to train an unannounced AI model. Harper Collins says authors can opt out of the scheme, but as you can imagine, this has proven
Starting point is 00:01:18 controversial. In a statement to Bloomberg News, Harper Collins confirmed it reached an agreement with an unidentified AI technology company that would, quote, allow limited use of select nonfiction backlist titles for training AI models to improve model quality and performance. Harper Collins' authors will have the option to participate or not, the company said, quote, part of our role is to present authors with opportunities for their consideration while simultaneously protecting the underlying value of their works and our shared revenue and royalty streams. Harper Collins said, this agreement with its limited scope and clear guardrails around model
Starting point is 00:01:51 output that respects authors' rights, does that, end quote. Technology companies use an array of data from social media sites to news articles to train AI models and companies like Microsoft are hunting for additional sources of high-quality text that they can license to make their programs more accurate, better, able to answer questions or provide expertise on specific subjects. News Corp signed an agreement in May with OpenAI to let the company use content from more than a dozen of its publications, including the Wall Street Journal, Barron's, and MarketWatch. OpenAI has also signed licensing deals with publishers including Alex Springer, the Atlantic, Vox Media, Dot Dash Meredith, Hearst Communications,
Starting point is 00:02:29 and Time magazine. Microsoft has worked on AI initiatives with Reuters, Hurst, and Axel Springer, which publishes Business Insider and Politico, end quote. Now, I became aware of all this because my friend Daniel Kibblesmith, the comic author and Colbert writer and guests on my rad history episode about Calvin and Hobbs, went viral when he posted screenshots from an email from his agent asking if he wanted to participate. Quoting from the A.V. Club, Kibblesmith was apparently offered a non-negotiable $2,500 to allow his book to be bundled
Starting point is 00:03:02 in with other works for training covering a TV. three-year period of use. The posted email which invokes the specter that, quote, these AI models may one day make us all obsolete, also mentions that, quote, several hundred authors have already agreed to the deal and emphasizes the stance that, hey, getting paid to have your work fed into an AI woodchipper is better than having it stolen for that same purpose. Kibblesmith did not agree, including in his post a screenshot of his rejection of the deal, which he called abominable. In a statement to the A.V. Club, Kibblesmith wrote that, quote, It seems like they think they're cooked and they're chasing short money while they can.
Starting point is 00:03:37 I disagree. The fear of robots replacing authors is a false binary. I see it as the beginning of two diverging markets. Readers who want to connect with other humans across time and space, or readers who are satisfied with a customized on-demand content pellet fed to them by the big computer so they never have to be challenged again, end quote. One thing I do want to note is $2,500 for a single book, $2,500 a pop, Might be cheap sounding from the author's perspective, but that is insanely expensive if you're thinking of it from the AI model perspective of trying to get new training data at scale. As Megan Fox said on Blue Sky, it isn't obvious yet, but this is where AI ends. The second you put any non-fixed monetary costs in the data, it costs too much.
Starting point is 00:04:24 Might have worked at smaller models, but at the scale they're at now, LOL, no. even if it's a fixed and not per piece rate now won't stay such, end quote. Meanwhile, Penguin Random House is going the other way. It has amended its copyright notice globally to prohibit the use of books for training AI. The notice will be included in all new titles and reprints. So, I guess, sort of like a robots.com text notice, but analog? Quoting the bookseller. The new wording states, quote, no part of this book may be used or reproduced in any manner for the purpose of training artificial intelligence, technologies, or systems, and will be included in all new titles and any backlist titles that are reprinted. The statement also, quote, expressly reserves the
Starting point is 00:05:06 titles from the text and data mining exception in accordance with a European Parliament directive. The move specifically to ban the use of its titles by AI firms for the development of chatbots and other digital tools comes amid a slew of copyright infringement cases in the U.S. And reports that large tranches of pirated books have already been used by tech companies to train AI tools. In 2024, several academic publishers, including Taylor and Francis, Wiley, and Sage have announced partnerships to license content to AI firms. PRH is believed to be the first of the Big Five Anglophone Trade Publishers to amend its copyright information to reflect the acceleration of AI systems and the alleged reliance by tech companies on using published work to train language models, end quote. Google has shipped the first Android 16 developer preview far ahead of schedule compared to the past decade or so. they're hoping to lower fragmentation by giving OEMs more time to get things together for the new OS.
Starting point is 00:06:06 This is something that I told you about recently, quoting Android Police. Surprise, Google is releasing the first developer preview of Android 16 today so that app developers can test the new APIs and behavior changes. That'll arrive in next year's big update. Android 16 Developer 1 is going live with new features like an embedded photo picker, medical record support, and an updated privacy sandbox. First of all, it might surprise you to hear that Google is releasing Android 16 DP1 today. After all, the first developer preview of Android 15 launched in February of 2024.
Starting point is 00:06:39 February is also when Google released the first developer previews of the last four major Android releases, making November quite early for the first preview of the next major Android release. However, Google announced at the end of last month that its accelerating Android's release schedule. The company confirmed it plans to release Android 16 sometime in Q2 of next year, One leak points towards a June 3rd, 2025 release date for Android 16, but that hasn't been confirmed yet. Android 16, Beta 3 in March 2025 will mark the operating system's platform stability milestone. When Android 16 reaches platform stability, Google assures that subsequent updates to Android 16 won't change any existing APIs, add new APIs, or modify app-facing system behaviors. The platform stability milestone is also when developers will be allowed to release updates to their apps on Google Play to make them target Android.
Starting point is 00:07:28 16. For reference, Android 15 reached its platform stability milestone with its third beta in June 2024, so again, Android 16 will be ready for users and developers much earlier than usual, end quote. Sony has launched cloud streaming on the PlayStation portal in beta, letting PlayStation Plus premium subscribers in some countries stream select PS5 games. Quoting the verge, you'll need to be a PlayStation Plus premium subscriber to take advantage of it. Sony says that to stream at 720p, you'll need a minimum 7mbPS connection, while 1080P quality will require a minimum 13 mbPS connection. Some PlayStation Plus features won't be available to start with cloud streaming to the PlayStation portal, including game trials, party voice chat, game invites for select
Starting point is 00:08:20 games, 3D audio, and in-game commerce, and you won't be able to stream any PS4 games or PS3 games. Child accounts won't also be able to use cloud streaming on the portal, end quote. More signs that crypto is ascendant President-elect Trump has nominated Howard Lutnik, CEO of Canter Fitzgerald, to serve as his Commerce Secretary. The crypto angle here is that Canter Fitzgerald has been a custodian for stablecoin company Tether since 2021, quoting Coin Desk. He will lead our tariff and trade agenda with additional direct responsibility for the Office of the United States Trade Representative Trump said in a statement, Lutnik's firm Cantor Fitzgerald is a stalwart and finance known particularly for its
Starting point is 00:09:07 influential role in the bond market. That includes serving as a primary dealer, an exclusive group of firms allowed to trade directly with the Federal Reserve. It has also dabbled in crypto. Since 2021, it has helped tether manage the giant stockpile of U.S. Treasuries that back its USDT stablecoin. Canter Fitzgerald also recently announced a Bitcoin financing business aimed at providing leverage to Bitcoin investors with $2 billion of initial funding. I am a fan of crypto, but let me be very specific, Bitcoin, just Bitcoin. These other coins, they are just not a thing, he told CNBC in a podcast last year, adding, I'm a big fan of this stable coin called Tether. Lutnik has argued Bitcoin should be treated as a
Starting point is 00:09:47 commodity, meaning it would face less regulation than, say, stocks or bonds, and is superior to other cryptocurrencies given its lack of central authority and censorship-resistant properties, end quote. update on Blue Sky, which says it now has more than 20 million users after hitting 15 million users just on November 13th, and the app has been number one in the U.S. App Store since November 13th per app figures, quoting TechCrunch. What's more, new data indicates the app's rapid growth is seeing it close the gap with another prominent ex-rival Instagram threads across metrics like daily active users and website visits. Blue Sky's user base is still much smaller than threads, which recently reported north of 260. 75 million monthly active users. However, if Blue Sky's current rate of growth holds up, it could catch up with threads in time, market intelligence firm Similar Web believes. Its data indicates that
Starting point is 00:10:43 threads had five times more daily active users than Blue Sky ahead of the U.S. elections, but on November 15th, a peak day of activity for Blue Sky, threads lead over Blue Sky had been reduced to just 1.5x in the U.S. Daily active users include the mobile apps on iOS and Android, not website visitors. Instagram head Adam Mosseri denied similar webs data is accurate, but meta does not share DAWS. The firm's data also indicates that Blue Sky overtook threads in daily website visits in both the U.S. and the UK, which signals strong interest from potential new users. Globally, daily website visits on Blue Sky haven't yet surpassed threads, but it's come very close as of mid-November. The Musk-owned App X remains dominant, though, with a U.S. daily active user count that's more than 10 times larger than Blue Sky at present.
Starting point is 00:11:30 end quote. Finally today, you know how various governments around the world are worried about app tracking and specifically TikTok tracking people? The first bans announced are always, oh, government officials and military personnel can't have TikTok on their personal devices. The Chinese government has made similar bans for their government workers, but in their case, banning Western apps. This is probably why. Wired did an investigation using U.S. data broker data stream and found they could easily track the movements of U.S. military and intelligence workers in Germany. Quoting Wired, a joint investigation by Wired, Bayes-Rikerrund Funk, and NetsPolitic.org, reveal that U.S. companies legally collecting digital advertising data are also providing the world a cheap
Starting point is 00:12:22 and reliable way to track the movements of American military and intelligence personnel overseas from their homes and their children's schools to hardened aircraft shelters within an airbase where U.S. nuclear weapons are believed to be stored. A collaborative analysis of billions of locations obtained from a U.S. data broker provides extraordinary insight into the daily routines of U.S. service members. The findings also provide a vivid example of the significant risks. The unregulated sale of mobile location data poses to the integrity of the U.S. military and the safety of its service members and their families overseas. We tracked hundreds of thousands of signals from devices inside sensitive U.S. installations in Germany. That includes scores of devices of devices,
Starting point is 00:13:01 within suspected NSA monitoring or signals analysis facilities, more than a thousand devices at a sprawling U.S. compound where Ukrainian troops were being trained in 2023 and nearly 2,000 others at an Air Force base that has crucially supported American drone operations. A device likely tied to an NSA or intelligence employee broadcast coordinates from inside a windowless building with a metal exterior known as the tin can, which is reportedly used for NSA surveillance, according to agency documents leaked by Edward Snowden. Another device transmitted signals from within a restricted weapons testing facility revealing its zigzagging movements across a high-security zone
Starting point is 00:13:40 used for tank maneuvers and live munitions drills. We traced these devices from barracks to work buildings, Italian restaurants, Aldi grocery stores, and bars. As many as four devices that regularly pinged from Ramstein Air Base were later tracked to nearby brothels off-base, including a multi-story facility called Sex World. Experts caution that foreign governments could use this data to identify individuals with access to sensitive areas. Terrorists or criminals could decipher when U.S. nuclear weapons are least guarded.
Starting point is 00:14:11 Or spies and other nefarious actors could leverage embarrassing information for blackmail. The unregulated data broker industry poses a clear threat to national security, says Ron Wyden, a U.S. senator from Oregon with more than 20 years overseeing intelligence work. It is outrageous that American data brokers are selling location data collected from thousands of brave members of the armed forces, who serve in harm's way around the world, end quote. To give you an example of what they're talking about, let me quote from the opening graphs of the story. Nearly every weekday morning, a device leaves a two-story home near Weizbaden, Germany, and makes a 15-minute commute along a major Autobahn.
Starting point is 00:14:48 By around 7 a.m., it arrives at Lucius D. Clay Kazner, the U.S. Army's European headquarters, and a key hub for U.S. intelligence operations. The device stops near a restaurant before heading to an office near the base that belongs to a major government contractor responsible for outfitting and securing some of the nation's most sensitive facilities. For roughly two months in 2023, this device followed a predictable routine, stops at the contractor's office, visits to a discreet hangar on base, and lunchtime trips to the base's dining facility. Twice in November of last year, it made a 30-minute drive to the Dagger Complex, a former intelligence and NSA signals processing facility. on weekends, the device could be traced to restaurants and shops in Weisbaden. The individual carrying this device likely isn't a spy or high-ranking intelligence official. Instead, experts believe they're a contractor who works on critical systems.
Starting point is 00:15:36 HVAC, computing infrastructure, or possibly securing the newly built consolidated intelligence center, a state-of-the-art facility suspected to be used by the national security agency. Whoever they are, the device they're carrying with them everywhere is putting U.S. national security at risk. an internal Pentagon presentation obtained by the reporting collective claims that not only is the domestic data collection likely capable of revealing military secrets, it is essentially unavoidable at the personal level, service members' lives being simply too intertwined with the technology permitting it. This conclusion closely mirrors the observations of Chief Justice John Roberts of the U.S. Supreme Court, who in landmark privacy cases within the past decade, described cell phones as being a, quote, pervasive and insistent part of daily life, and that owning one was, quote, indispensable to participation in modern society, end quote.
Starting point is 00:16:26 Let me come back to the concern for potential blackmail for a second. Things like, you know, a foreign agent being like, do you want your spouse to know you went to a place called Sex World? If not start working with me. But also, on a basic level, location data can piece a lot of secrets together on their own, quoting one last time. When it comes to jeopardizing national security, even data on low-level personnel can pose a risk, says Vivek Chulukuri at CNS.
Starting point is 00:16:57 Low-level targets can lead to high-level compromises, Chilukuri says. Even if someone isn't senior in an organization, they may have access to highly sensitive infrastructure. A system is only as secure as its weakest link. He points out that if adversaries can target someone with access to a crucial server or database, they could exploit that vulnerability to cause serious damage. It just takes one USB stick plugged into the right device to comment. compromise an organization, he said. It's not just individual service members who are at risk. Entire security protocols and operational routines can be exposed through location data.
Starting point is 00:17:29 At Bucal Air Base, where the U.S. is believed to have stored an estimated 10 to 15 B-61 nuclear weapons, the data reveals the daily activity patterns of devices on the base, including when personnel are most active and, more concerningly, potentially when the base is least populated, end quote. Everyone head over to the rad history feed because this week's episode is about the Challenger disaster. My guest is the great Farhad Manjou. It's very interesting. It was something that I remember, but I didn't know all the details of. Check that out. Talk to you tomorrow.

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.