Tech Brew Ride Home - Fri. 07/28 – Is The Clock Ticking For Threads?

Starting point is 00:00:00 On April 4th, 2023, around 2 in the morning, a man was found stabbed multiple times on a sidewalk in downtown San Francisco. Hey, who did this to you? What happened next turned the story into a political firestorm. Reports have identified the victim as Bob Lee, the founder of Cash App. From Bloomberg Podcasts, this is Foundering, the Killing of Bob Lee, beginning April 16. Welcome to the time. Tech me right home for Friday, July 28th, 2020. I'm Brian McCullough today. The whole Threads saga has been a whirlwind. At the beginning of the month, I asked if Threads had already won. And now at the end of the month,

Starting point is 00:00:45 I'm wondering if the clock is ticking in terms of their chances of survival. Generative AI. But for robots, again, real robots. Are VCs pulling back from the crypto space and, of course, the weekend long-read suggestions. Here's what you missed today in the world of tech. How's Threads doing? Part 11? Yeah, it seems even Zuck is aware. that it remains undercooked, and if it remains that way too much longer, it's going to be a problem increasingly. At an internal town hall, meta-exexed said meta is looking to add more retention driving hooks to threads, like showing Instagram users important threads. Though, again, desktop, DMs, anything, hashtags? Maybe that sooner rather than later, quoting Reuters.

Starting point is 00:01:34 Meta Platforms executives are heavily focused on boosting retention for their new Twitter rival threads. After the app lost more than half of its users in the weeks following its buzzy launch, CEO Mark Zuckerberg told employees on Thursday. Retention of users on the text base app was better than executives had expected, though it was not perfect, said Zuckerberg, speaking at an internal company town hall, the audio of which was heard by Reuters. Obviously, if you have more than 100 million people sign up, ideally it would be awesome if all of them or even half of them stuck around were not there yet, he said. Zuckerberg said he considered the drop-off normal and expected retention to grow as the company adds more features to the app,

Starting point is 00:02:12 including a desktop version and search functionality. Meta is looking at adding more retention driving hooks to entice users to return to the app, like, quote, making sure people who are on the Instagram app can see important threads, said chief product officer Chris Cox, end quote. Jackbots are cool, image generating AI is cool, but what about an LLM that enables robots to exist in the real world, in real meatspace. Google has launched RT2 or Robotics Transformer 2, a, quote, vision-language action model trained on text and images from the web that can output robotic actions again in the real world.

Starting point is 00:03:00 Quoting the New York Times. A quiet revolution is underway in robotics, one that piggybacks on recent advances in so-called large language models, the same type of artificial intelligence system that powers chat GPT, bard, and other chatbots. Google has recently begun plugging state-of-the-art language models into its robots, giving them the equivalent of artificial brains. The secretive project has made the robots far smarter and given them new powers of understanding and problem-solving. I got a glimpse of that progress during a private demonstration of Google's latest robotics model

Starting point is 00:03:30 called RT2. The model, which is being unveiled on Friday, amounts to a first step toward what Google executives described as a major leap in the way robots are built and programmed. A one-armed robot stood in the front of a table. On the table sat three plastic figurines, a lion, a whale, and a dinosaur. An engineer gave the robot an instruction, pick up the extinct animal. The robot word for a moment, then its arm extended and its claw opened and descended. It grabbed the dinosaur. Until very recently, this demonstration, which I witnessed during a podcast interview at Google's Robotics Division in Mountain View, California last week, would have been impossible. Robots weren't able to reliably manipulate objects they had never seen before, and they certainly weren't capable of making the logical leap from extinct animal to plastic dinosaur.

Starting point is 00:04:18 To understand the magnitude of this, it helps to know a little bit about how robots have conventionally been built. For years, the way engineers at Google and other companies trained robots to do a mechanical task, say flipping a burger, for example, was by programming them with a specific list of instructions. Lower the spatula, 6.5 inches, slide it forward until it encounters resistance, raise it 4.2 inches, rotate it 180 degrees, and so on. Robots would then practice the task again and again, with engineers tweaking the instructions each time until they got it right. This approach worked for certain limited uses, but training robots this way is slow and labor intensive. It requires collecting lots of data from real-world tests. And if you wanted to teach a robot to do

Starting point is 00:04:58 something new, to flip a pancake instead of a burger, say, you usually had to reprogram it from scratch. In recent years, researchers at Google had an idea. What if, instead of being programmed for specific tasks one by one, robots could use an AI language model, one that had been trained on vast swaths of internet texts to learn new skills for themselves. Google's new robotics model, RT2, can do just that. It's what the company calls a vision language action model or an AI system that has the ability, not just to see and analyze the world around it, but to tell a robot how to move. It does so by translating the robot's movements into a series of numbers, a process called tokenizing and incorporating these tokens into the same training data as the language model.

Starting point is 00:05:40 Eventually, just as chatyBT or Bard learns how to guess what words should come next in a poem or a history essay, RT2 can learn to guess how a robot's arm should move to pick up a ball or throw. an empty soda can into the recycling bin. In other words, this model can learn to speak robot, Mr. Hausman said, end quote. This is either extremely impressive and exciting or extremely worrying. I can never tell anymore. Apple plans to require developers to submit reasons to use certain APIs going forward in their apps, beginning this fall, apparently to prevent the misuse of those APIs for fingerprinting purposes.

Starting point is 00:06:24 Quoting 9 to 5 Mac. As detailed on the Apple developer website, some APIs are now classified as required reason APIs. This means that in order to use them in an app, the developer must describe to Apple the purpose of that API in the app. The company explains that the measure aims to crack down on fingerprinting a technique for tracking users across different apps and websites. Starting this fall with the release of iOS 17, TVOS 17, WatchOS 10, and MacOS Sonoma to the public, developers will be notified about submitting apps using a required reason reason. API without describing the reasons for using it. From Spring 2024, apps that use these APIs without a valid reason will be rejected. To prevent the misuse of certain APIs that can be used

Starting point is 00:07:05 to collect data about users' devices through fingerprinting, you'll need to declare the reasons for using these APIs in your app's privacy manifest. This will help ensure that apps only use these APIs for their intended purpose, Apple explains. While this measure was created with privacy in mind, some developers told 9 to 5 Mac they're concerned about app and up date rejection rates increasing further. For example, Apple says that user defaults is one of the required reason APIs. For those unfamiliar, this is a basic and fairly common API that stores user preferences for an app, which means lots of apps use it. This can result in developers having their apps rejected simply for forgetting to add an explanation for using the API. At the same

Starting point is 00:07:45 time, it's hard to imagine how Apple will control the use of this API, since most developers can simply say they're storing user preferences with it. Apple will let developers appeal a rejection and submit a request to approve a situation that is not covered in the current guidelines. More details can be found on the Apple developer website, end quote. Sources say Sequoia cut the size of its cryptocurrency fund to $200 million from $585 million. They also cut their ecosystem fund, which invests in other funds, to $450 million from $900 million, quoting the Wall Street Journal. Sequoia told fund investors, in March it made the decision to reduce the funds

Starting point is 00:08:29 to better reflect the changed market. The cryptocurrency fund, for example, will focus more on backing young startups after an industry crash wiped out opportunities to back larger companies. By pairing back the fund sizes, Sequoia is lowering the amount of committed capital required from investors, known as limited partners who are already seeing lower returns from venture funds and are bracing for further markdowns. The changes show the difficult cuts venture firms are making during one of the roughest years and recent memory for the startup industry. They are trying to undo the breakneck expansion and liberal spending that characterized a historic startup boom, which no longer makes sense as dealmaking slows and funds struggle to raise more cash.

Starting point is 00:09:06 Sequoia announced the two funds in February 2022 as part of an ambitious firm restructuring after spending months ramping up its investments in crypto. The crypto crash has since wiped out revenue for many blockchain startups, end quote. It is part of my job here to Trendspot for you all. And look, Sequoia is going through major, major generational transformation of a It's very structure at this moment, so take this as a grain of salt anic data, but it is interesting to note that the area they want to cut back on first is crypto. Oh, and fund of funds that support emerging fund managers like myself, too. But also, I'm going to keep an eye open for how

Starting point is 00:09:44 investment money is or maybe is not continuing to flow into the crypto space. Time for the weekend long read suggestions. And first up, this week I stumbled across the best explainer for lay people I've found yet, an explainer of how large language models actually work. Like, seriously, if you are non-technical in the space and want the clearest walkthrough of how this works conceptually, try this piece from understanding AI. Quote, the above diagram depicts a purely hypothetical LLM, so don't take the details too seriously. We'll take a look at research into real language models shortly. Real LLMs tend to have a lot more than two layers. The most powerful version of GPT3, for example, has 96 layers. Research suggests that the first few layers focus on understanding

Starting point is 00:10:36 the syntax of the sentence and resolving ambiguities like we've shown above. Later layers, which we're not showing to keep the diagram a manageable size, work to develop a high-level understanding of the passage as a whole. For example, as an LLM reads through a short story, it appears to keep track of a variety of information about this story's characters. sex and age, relationships with other characters, past and current location, personalities and goals, and so forth. Researchers don't understand exactly how LLMs keep track of this information, but logically speaking, the model must be doing it by modifying the hidden state vectors as they get passed from one layer to the next. It helps that in modern LLMs, these vectors are extremely

Starting point is 00:11:17 large. For example, the most powerful version of GPT3 uses word vectors with 12,288 dimensions. That is, each word is represented by a list of 12,288 numbers. That's 20 times larger than Google's 2013 Word2-Vec scheme. You can think of all of these extra dimensions as a kind of scratch space that GPT3 can use to write notes to itself about the context of each word. Notes made by earlier layers can be read and modified by later layers, allowing the model to gradually sharpen its understanding of the passage as a whole. So suppose we changed our diagram above to depict a 96-layer language model interpreting a thousand-word story. The 60th layer might include a vector for John with a parenthetical comment like main character, male, married to Cheryl, cousin of Donald from Minnesota, currently in

Starting point is 00:12:05 Boise, trying to find his missing wallet. Again, all of these facts and probably a lot more, would somehow be encoded as a list of 12,288 numbers corresponding to the word John. Or perhaps some of this information might be encoded in the 12,288 dimensional vectors for Cheryl Donald Boise Wallet or other words in the story. The goal is for the 96th and final layer of the network to output a hidden state for the final word that includes all of the information necessary to predict the next word, end quote. Then Fast Company has a profile of my friends over at the Acquired podcast. You've heard the Acquired guys. They've been on this show for our World Cup of Entrepreneurs bonus episodes. I've been on their show a couple times. But here's a great behind the scenes of

Starting point is 00:12:52 how and why their pod has gotten so popular. Quote, the research effort to produce an episode of Acquired wasn't always this Herculian. In January 2020, the duo told the Divinations newsletter writer Nathan Batchez that prep typically requires five to ten hours per episode. Two and a half years later, investor and podcaster Logan Bartlett asked about what goes into an episode, and Gilbert and Rosenthal replied that they cumulatively spend a hundred hours. Now, a year later, the host seemed almost consumed by the process. process, logging 60 hours a week for a month. The night before recording the Nike episode,

Starting point is 00:13:28 Rosenthal, says, my wife, Jenny, was like, I get my husband back tomorrow night. Gilbert chimes in, yeah, my wife feels that way, too. When asked if there's an upper limit to the amount of work that goes into an episode, Rosenthal admits, I've literally had this discussion over the past month with my therapist. Let's pause the flag that we haven't yet mentioned Rosenthal's 39-page Nike script that he produced from all that research, or Gilbert's list of insights. He's curated to bring up during taping that runs almost 4,000 words. They both stand during what can now be an eight-hour taping session with Gilbert wearing his cushiest Nike Invincible Two running shoes, each of them powering through on an occasional bite of an energy bar, a kind five grams of sugar

Starting point is 00:14:08 bar for Gilbert, a Met RX, 100 for Rosenthal. An editor helps whittle down the recording to about four and a half hours. Then Gilbert's self-described persnicketyness, along with some bigger picture suggestions from Rosenthal, further refine the episode to four hours, three minutes, 28 seconds, end quote. Finally, from rest of world, a profile of the vanishing world of the internet cafe. History will look back at internet cafes as a key vector for giving millions of people around the world their first access to the online world. But now that digital data is ubiquitous, even in the most remote corners of the world, do internet cafes still have a place? quote, Derek Buchena, now nearly 40 years old, has spent most of the last two decades around computers.

Starting point is 00:14:57 In the early 2000s, computers were very rare in Uganda, as were opportunities to learn how to use them. But the church Buchania attended announced free computer classes in partnership with Zion Internet Cafe, one of the first and most prominent internet cafes in Uganda, located in Kampala's central business district. In 2014, Buchania launched his own internet cafe, BK Internet Cafe, in a low-income Kampala suburb. At the time, the Uganda Communications Commission estimated that Uganda had about 10 million internet users representing roughly a third of the country's population. When I launched my cafe, the business was booming, Buchania said. He launched four more cafes in the neighborhood and remembers them being packed with young people. Facebook was the go-to social network. Young people could spend

Starting point is 00:15:39 hours just chatting and catching up with friends, he said. Then also Google had taken a foothold, so our customers were just searching for anything. YouTube was prohibited because it would consume a lot of data and the internet speeds weren't great anyway, end quote. Bukenia said the industry shifted in 2016. Cheap Chinese smartphones suddenly became widely available across Uganda. All of the sudden, one could get a good smartphone for less than 500,000 shillings or 170, he said. In their wake, telecom providers launched cheap internet and data bundles. Over the next three years, internet cafes in the country started to close down. By the time COVID-19 came around, I was the only internet cafe left standing, said Buchania, who has since diversified his revenue. He now makes the most money

Starting point is 00:16:20 selling movies and offering printing, photocopying, and scanning services. Buchania admits the internet part of his cafe no longer attracts users. Some days, no one uses the computers at all. I don't think it makes sense to have a business that serves very few users, he said. Printing, scanning, and photocopying documents sustain us for now, but I am thinking of pivoting to a co-working space to tap into the remote workers, end quote. No week on bonus episodes for you this weekend, but I think we might have one next week. Talk to you on Monday.

Tech Brew Ride Home - Fri. 07/28 – Is The Clock Ticking For Threads?

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.