The Journal. - How DeepSeek Sank The Stock Market

Episode Date: January 29, 2025

Last week, the Chinese company DeepSeek debuted a new AI model -- and overturned years of conventional wisdom about what it takes to build great AI. The shock unleashed a $1 trillion bloodbath on Wall... Street. WSJ’s Stu Woo and WSJ’s Gunjan Banerji unpack DeepSeek's achievement and the market chaos it unleashed. Further Reading: -How China’s DeepSeek Outsmarted America  -The Day DeepSeek Turned Tech and Wall Street Upside Down  Further Listening: -The Company Behind Chat GPT  -The Hidden Workforce That Helped Filter Violence and Abuse Out of ChatGPT  Learn more about your ad choices. Visit megaphone.fm/adchoices

Transcript
Discussion (0)
Starting point is 00:00:00 So what was one of your first clues that Monday was going to be an interesting day? Yeah, it's funny. I was actually commuting in from Long Island on Monday morning. I was on the Long Island railroad and I looked at my phone, I was checking my emails, obviously checking the markets. That's my colleague Gunjan Banerjee. She hosts WSJ's Take On The Week podcast and has covered markets and investing for the journal for years.
Starting point is 00:00:31 And I see that futures contracts tied to the NASDAQ composite were down more than 4%, which was really an eye-popping move. We had not seen a move of that magnitude in quite some time. Something was going down in the stock market. And by the time Gunjian made it to the office, things had gotten even wilder. I get to my desk and, you know, around the time the market opens at 9.30,
Starting point is 00:01:01 I think everyone is kind of glued to their screens at that point, and they see that this really ugly day for the stock market is beginning. All three major indexes are down a bunch. NVIDIA, of course, is down double digits. NVIDIA, the AI chipmaker. Its stock was tanking. We could be looking at the biggest drop in market cap on record here for NVIDIA when
Starting point is 00:01:23 you take a look at the right across the screen, specifically the NASDAQ 100 features and the Intensys. So NVIDIA was down more than 10% shortly after the opening bell. It ended the day down 17%. And just to put that into context, that is a market value loss of almost $600 billion. So much money. Right, that is just an insane amount of wealth and amount of value that evaporated within hours. In fact, it is the biggest one day market value drop
Starting point is 00:01:58 on record. By the end of Monday, about a trillion dollars of value had been wiped from the stock market. But while the drop was historic, Gunjian also had a pretty good idea of why it was happening. So what traders, people on Wall Street, Silicon Valley was pointing to was this upstart artificial intelligence company DeepSeek. DeepSeek. It's an AI company out of China. And over the last few days, its chatbot has been blowing people away.
Starting point is 00:02:32 Experts say DeepSeek's AI is just as capable, maybe even more capable, than leading AI chatbots like ChatGPT, but its creators claim it was made for much less money. And that set off a major shakeup in Silicon Valley and on Wall Street. DeepSeek, this new artificial intelligence competitor, forced everyone to take a look at their portfolios, take a look at their AI products, and really rethink who the winners and losers of this artificial intelligence trade were
Starting point is 00:03:05 going to be. All of a sudden investors were going, hey are the stocks that we own? Is Nvidia, is it worth what we think it's worth? Welcome to The Journal, our show about money, business, and power. I'm Jessica Mendoza. It's Wednesday, January 29th. Coming up on the show, how DeepSeek sank the stock market. Calling all sellers. Salesforce is hiring account executives to join us on the cutting edge of technology.
Starting point is 00:03:51 Here, innovation isn't a buzzword. It's a way of life. You'll be solving customer challenges faster with agents, winning with purpose, and showing the world what AI was meant to be. Let's create the agent-first future together. Head to salesforce.com slash careers to learn more. So what is DeepSeek? DeepSeek is an AI chatbot.
Starting point is 00:04:23 It's if you've tried chat GPT, it's just like that. You go to the website, you log in, and you ask it a question, and it'll give you an answer as if a pretty smart human were answering it. That's my colleague, Stu Wu. He covers tech in Asia. And how did you first hear about DeepSeek? I was doing this video interview with somebody in San Francisco who was the founder of an AI company.
Starting point is 00:04:46 And we were talking about something else and he didn't know something. So he shared his screen with me and said, let me look it up. And what I thought was weird was that he didn't go to Google or chat GPT. He went to something I never heard of, DeepSeek. And what he said was that he'd been playing with it for the past couple days and he and his coworkers were just talking about it, about how it was amazing and probably just as good as all the American competitors that he's been looking at.
Starting point is 00:05:13 So tell us a little bit about the company itself and the AI model. Who made it? So it's the brainchild of a Chinese guy named Liang Wenfeng. He co-founded this hedge fund in China. It's based in Hangzhou, which is also the same tech hub where the Chinese company Alibaba is based. DeepSeek grew out of that. Liang is a pretty smart guy. He studied AI at one of China's top engineering programs. What I thought was really interesting about the company was that it had this really unusual hiring practice.
Starting point is 00:05:46 Liang wants creative people, but he doesn't really care that much about experience. And he says his hiring principle is hire people with the least amount of experience because his idea is that if you ask someone with work experience to solve a problem, they're going to say, well, we should solve it like this because this is how I've done it in the past. But if you ask people with that experience to solve that same problem, they'll have to sit down, think about the problem, and then they'll figure out the best and freshest and most efficient way to do it. So that's why a lot of people who work at DeepSeek are either fresh graduates or people with just a year or two of work experience. And so this sort of takes us to, you know, what the AI model, that approach kind of created.
Starting point is 00:06:34 So before DeepSeq, what was sort of the going assumption about how you make a cutting edge AI model? Yeah. So the conventional thinking was that if you wanted to make a world-class AI chatbot or AI system, you needed a lot of the world's best AI chips that are super expensive as well. In the US, AI development has been dominated by a handful of big tech companies who've trained their AI models using tons of top-line AI chips.
Starting point is 00:07:06 Those chips are largely made by, you guessed it, Nvidia. The assumption was, if you didn't have enough of the right kind of chips, you couldn't build a world-class AI model. And the other assumption was that a Chinese company could never do that because the US government had set these restrictions on what kind of chips US companies could sell to China. The thinking was that China would never catch up. So, let's take a look at those assumptions. The first of those assumptions is that, like you said, you need a lot of chips to create these high-powered AI models. How did Deep Seek sort of undermine that assumption? Yeah. So, Deep Seek released this research paper and explained
Starting point is 00:07:45 how it did what it did. And it said that it spent a fraction of the money developing its advanced chatbot and it did so using less advanced chips. So how can we understand that? So I think a good analogy is that let's look at the the first chat GPT that many of us have used. And let's try to understand how that was trained. So imagine that chat GPT is like a librarian that's read all the books in the library. And when you ask it a question, it'll give you an answer because it's read that book.
Starting point is 00:08:17 But the problem is that to read all those books, that requires a lot of time and a lot of electricity for those computer chips to read those books. So DeepSeek didn't have those resources. So it tried a new approach. So imagine you're still in the library and DeepSeek is a librarian, but it hasn't read all those books. What it does instead is that it's focused on being really good at figuring out what
Starting point is 00:08:39 book has the answer after you ask it the question. And it turns out that's just as effective as what Chat GPT originally did. It was just as good, but it used a fraction of the resources. It makes me think a little bit about kind of expert versus journalist in some ways. It's like what we do is we know who to ask and what questions to ask instead of actually like getting the PhD.
Starting point is 00:09:04 We go to the experts ourselves versus the expert who has to like learn everything about that subject. Yeah, you know, that's a good example. There's very few of us who can just read all those books and just maintain all that information in their head. And then when we have to figure out, we just kind of like stress out and call everybody we know and try to answer that question within an hour. Exactly.
Starting point is 00:09:22 But DeepSeek does that in just a few seconds. And then the second assumption here is that a Chinese company couldn't do this because they wouldn't have access to the best chips, to NVIDIA's chips. First of all, when and why did the U.S. start restricting the export of AI chips to China? So the thinking during the Biden administration was that AI is going to be really important for military purposes. So just imagine you can use it for developing a nuclear weapon or a biological weapon or helping a general make a decision on the battlefield. It could give one side an absolute advantage.
Starting point is 00:10:01 So that's why they decided we got to stay a couple of years ahead of China on AI. We can't lose an edge with AI on the battlefield. In 2022, the Biden administration put restrictions on the kinds of chips US companies could sell to China. So what they said was that if you're a US company that wants to sell these chips to China, you have to restrict this parameter called interconnect bandwidth. And the analogy that I would use is that if you were designing a race car,
Starting point is 00:10:26 this restriction would constrict how much gasoline ran through the fuel line. NVIDIA followed that rule, but it also figured out a workaround for its Chinese chips. It complied with that fuel line. The fuel line was constricted, but it increased performance in other parts of the car engine to compensate for that, to make the most out of the fuel it did have. The result was that the chips Nvidia was selling in China were more powerful than the U.S.
Starting point is 00:10:54 government would have preferred. The Biden administration eventually cut off that workaround, but it took about a year. So that gave DeepSeek and other companies a year to buy these pretty powerful chips. And if you look at one of DeepSeek's research papers, it said it used about 2,000 of these powerful China-only chips from NVIDIA to train one of its advanced AI models. Last week, DeepSeek released its most advanced AI model yet, called R1. And what has the reaction been? Well, I can't remember anything quite like this. I mean, I think the closest thing is when chat GPT came out three years ago, and that's like kind of like change the world. Everybody's trying to write poems on it, you know.
Starting point is 00:11:35 Immediately. And, but this had some serious financial consequences, right? That financial fallout is after the break. What's up Spotify? This is Holly. I remember this one time we were on tour. We didn't have any guitar picks and we didn't have time to go to the store. So we placed an order on Prime and it got there the next day, ready for the show. Whatever you're into, it's on Prime. For the past few years, before DeepSeek crashed onto the scene, investors had been piling into AI stocks, betting on big returns. They called it the AI trade.
Starting point is 00:12:21 Here's Gunjian Banerjee again. Basically, investors had latched onto this idea that artificial intelligence was going to unleash this wave of productivity in the economy among US workers and lead to gobs and gobs of profits for a handful of big technology companies, including Nvidia. At its most recent peak,
Starting point is 00:12:44 the company was worth more than $3 trillion. And Nvidia wasn't the only company people were betting on. Who else did people think were the winners of AI? Really the big technology stocks. Even think of like Meta, Microsoft, which also has a competitor to ChatGPT. People were thinking of some of these huge technology companies in the US as the key winners from the AI boom. And when we're talking about sort of people piling on,
Starting point is 00:13:14 how big did this get? The AI trade completely ate the stock market. It just took over almost every corner of financial markets that you can imagine. Like energy stocks. People bought up shares of energy and utility companies because training AI models uses a lot of power. Did it feel like a bubble? It's interesting. There has been no shortage of investors the past few years saying, we think this artificial intelligence trade is a bubble. And one of the reasons for that is just the amount of exuberance
Starting point is 00:13:51 we've seen surrounding this trade and the levels of speculation. There was a lot of yoloing out there. You know, you only live once, let's go for it. This was another flavor of like, let's get really, really rich from trading these AI stocks, let's pile into their options, which are super risky and can provide kind of these boomer bus returns. Let's pile into really risky exchange traded products. So there was just this mountain of speculation building and building and building while the AI craze continued.
Starting point is 00:14:28 But it kept growing. It kept growing and growing and growing. It kept snowballing. And then came DeepSeek, a cutting edge AI product that wasn't built by a U.S. tech giant and seemingly didn't require a ton of chips. Investors who thought they knew who the winners of AI were suddenly weren't so sure. I think the DeepSeek news really spooked a lot of people about the valuations that they were assigning
Starting point is 00:14:57 to some of these technology giants. It was a moment that made people question where they had been putting their money the past few years. And why were investors backing away from Nvidia specifically? The one thing that a lot of investors were fixated on is that it seemed like DeepSeek needed a lot less computing power. So that would mean that the AI models of the future might not require as many high-end NVIDIA chips as investors have been counting on.
Starting point is 00:15:30 I mean, the way one investor put it to me was, we've been banking on NVIDIA being the disruptor. Are they being disrupted now? In a statement Monday, NVIDIA praised DeepSeek's advancements. It added that serving up these kinds of AI models to users requires large numbers of its chips. Since Monday's chaos, the market seems to have stabilized. Tech stocks rebounded on Tuesday, with NVIDIA up 9%.
Starting point is 00:15:58 But my colleague Stu Wu says that the AI industry is just beginning to wrestle with DeepSeq's model and its implications. For example, there's still a lot of questions about how DeepSeq pulled this off. So DeepSeq published some research papers that explained how it accomplished, what it accomplished, but it hasn't revealed all of its secrets. So we don't know exactly what the training data it used. We don't know what that looks like. And there's a lot of people in Silicon Valley
Starting point is 00:16:27 who are wondering out loud, without evidence, I might add, but this is informed speculation that maybe DeepSeq actually had even more powerful NVIDIA chips than it's letting on. So there's still a lot to figure out. DeepSeq disclosed some of its secrets, but not all of them. OpenAI, the maker of ChatGPT, has set its looking into whether DeepSeek used large volumes of OpenAI data to help develop its model.
Starting point is 00:16:52 DeepSeek didn't immediately respond to requests for comment. I'm curious, I mean, were all of those people also as surprised as the rest of us about this? I'm just trying to figure out how did everyone, Silicon Valley, those people in Wall Street, how did everyone miss this? Yeah, so how did everybody miss this? Okay, that's a good question. So after DeepSeat came out last week, a lot of prominent people in Silicon Valley, whether they're AI researchers or venture capitalists, went on X or some other platform and said this is really innovative, right? Like they just found a whether they're AI researchers or venture capitalists,
Starting point is 00:17:25 went on X or some other platform and said, this is really innovative. They just found a new way of doing this. And one of the guesses was Well, they looked at undervalued strategies in baseball and they figured out how to win despite this handicap. So that's one theory that resource constraints breeds creativity. Do you think part of the problem here was that people underestimated China? I've been thinking a lot about this question. So I think a lot of people are in general surprised at how far China has come in technology.
Starting point is 00:18:06 But in America, you don't actually get to see a lot of this because of effective bans on Chinese technology in America. A lot of Americans have never touched a Chinese cell phone made by Huawei or electric car made by BYD, which is one of the world's biggest car companies, right? These things basically don't exist in America. So I think what happened was that when Deep Sea came out, anybody could download it, ask it a question in English, and see the answer in English. And they're like, wow, January 29th.
Starting point is 00:18:50 The Journal is a co-production of Spotify and The Wall Street Journal. Additional reporting in this episode by Asa Fitch, Rafael Huang, Karen Langley, and Sam Schechner. Thanks for listening. See you tomorrow.

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.