Hard Fork - How Based is Grok 3? + Robinhood CEO Vlad Tenev on Markets For Everything + Vibecoding 101

Episode Date: February 21, 2025

This week, Elon Musk brought a new chatbot into the crowded A.I. universe — Grok 3, the latest model from his company xAI. We break down how it compares with other leading models and what it reveals... about Musk’s larger ambitions. Then, Vlad Tenev, the chief executive of the investing platform Robinhood, lays out his vision for the future of investing and fields some difficult questions about his company’s role in fueling a culture of risky financial speculation. Finally, Kevin revisits his high school coding era and tries to make Casey a new software tool, with an A.I. assist. Guest:Vlad Tenev, chief executive of Robinhood and co-founder of Harmonic. Additional Reading:There Are Probably Too Many A.I. Companies NowAn Investing Revolution Is Coming. The U.S. Isn’t Ready for It.Is Math the Path to Chatbots That Don’t Make Stuff Up? We want to hear from you. Email us at hardfork@nytimes.com. Find “Hard Fork” on YouTube and TikTok. Unlock full access to New York Times podcasts and explore everything from politics to pop culture. Subscribe today at nytimes.com/podcasts or on Apple Podcasts and Spotify.

Transcript
Discussion (0)
Starting point is 00:00:00 Oh, I am having quite a morning, Casey. So I was on the escalator, I was on the train today. I got off, I went up the escalator at Embarcadero. You know, it's a very long escalator. It is. And, you know, very crowded rush hour and someone bumped into me and knocked my phone out of my hand onto the platform below. And so I thought, okay, well, I'm midway up the escalator.
Starting point is 00:00:26 Wait, so how far did it drop? Probably 15 feet. Okay. Like a significant drop. Big drop. And I thought to myself, if I get to the end of this escalator and come back down, it's going to be too late.
Starting point is 00:00:37 Someone will have snatched it or accidentally kicked it onto the tracks like my phone has gone. Wait, you do not have much faith in the citizens of San Francisco. Have you visited San Francisco? Yes, I think a phone could generally survive 30 seconds on the ground, but I guess we'll find out what happens.
Starting point is 00:00:49 Anyway, well, I had severe separation anxiety in the split second before I decided to do what I did, which was to try to run down the crowded up escalator. So I became that guy who was like pushing through the commuters, being like, I'm sorry, I'm sorry. And it took forever because the escalator was moving in the opposite direction. So I started my morning by alienating
Starting point is 00:01:14 and possibly injuring some people on my way down to retrieve my phone. And I would just like to formally apologize to everyone at the Embarcadero subway stop between 8.15 and 8.30 this morning. You were like sort of a character in a bad comedy, running down the escalator. You know, I heard, I was at that platform this morning
Starting point is 00:01:33 and I heard a woman screaming, but now I'm realizing that was you. Did you get the phone? I did, it's safe, no cracks, it was retrieved, but yeah, that was a wild way to start my day. Well, thank you to all the good Samaritans of San Francisco who did not steal Kevin's phone during the 30 seconds when it was on the floor.
Starting point is 00:01:50 That, it kind of restores your faith in humanity a bit. Oh, it does. I'm Kevin Ruse, a tech columnist at the New York Times. I'm Casey Noon from Platformer. And this is Hard Fork. This week, are you ready to grock how Elon Musk's latest AI model could serve his larger ambitions? Then, Robin Hood CEO Vlad Tenev stops by the studio to make his case for letting everyone invest in everything. And finally, lock down your computers. Kevin is attempting to vibe code.
Starting point is 00:02:24 And the vibes are off. Let's go. Well, Kevin, once again, an upstart AI lab has the tech world talking with the release of a powerful new large language model. But unlike the others, this one might be running the federal government by springtime. This week, XAI, which is Elon Musk's AI company,
Starting point is 00:02:52 released its latest model, Grok 3, and based on their own benchmark results and early reviews, it seems like it is basically on par with the best models that are out there right now. And while it hasn't been subjected to rigorous independent testing, the early word from AI nerds is that it is pretty good. So Kevin, what is Grok 3? Well, Grok 3 is the new sort of premium tier model of Grok, which is XAI's AI model.
Starting point is 00:03:22 It is available to premium plus subscribers on X, which is their $40 a month premium tier, which is cheaper than OpenAI's most powerful plan, which is $200 a month. But it is also built into X, the former Twitter app. So if you've been on X recently, I know you don't go on there anymore. But I do. There's a tab where you can just open up Grok, and if you are a paying subscriber, which I'm not, but I somehow got past the velvet rope
Starting point is 00:03:49 because I used to be verified or something, you can actually use it. Yeah, and I should say that I actually have used Grok 3 for this exact same reason, which is I have just been given free access to this thing for some reason. I guess the Department of Exefficiency or DOCS has not yet uncovered my account. We both played around with it a little bit. What were your impressions of Grok 3? Well, you know, like others who have
Starting point is 00:04:15 commented, it seems like it is about as good as some of the other models. When I asked Grok about itself, it said Grok 3's launch is a pivotal moment in AI. Seemed like a bit much. But I also asked it if it had an opinion about Platformer, my newsletter, and it actually said some really nice things. Oh, that's nice. Which I had to respect. I asked about the New York Times as well, by the way,
Starting point is 00:04:38 expecting I would get some sort of angry tirade about it. But it was actually pretty even-handed, and I praise you guys for a lot of what you do over there. How about you? What have you been doing with it? So I put it through some of my, you know, proprietary evals. I actually do have things that I test AI models on. The Roost benchmarks.
Starting point is 00:04:57 The Roost benchmarks. And yeah, I would say it did okay. It was like not mind-blowingly good. It was not bad. It got some things that other models missed and vice versa. It did have access to x data, which is interesting, you can do things like tell it to analyze this person's posts on x and tell me what they think about this topic.
Starting point is 00:05:18 You know, I asked it, you know, there's this sort of famous question that we always love to ask large language models of you know, can you count the Rs and strawberry? I asked Grok the equivalent question for X, which is, can you count Elon Musk's children? Which is, it's known to be very difficult for large language models. Well, a new one just dropped. Exactly. And that's why it's so hard for them to keep up. But so and part of Elon Musk's pitch for Grok for the past year or so has been that this is
Starting point is 00:05:43 going to be a relatively uncensored AI model. It's not going to give you these progressive responses. It'll tell you the truth, cut through the BS, get to the ground level reality. And so I decided to test it out. I asked it how many genders are there. And it said the question of how many genders exist depends on the context.
Starting point is 00:06:08 Gender is fluid. Some argue there are only two. Others say there are many, sometimes dozens. There's no hard number. So it gave you sort of the progressive take on gender, which I have to imagine Elon Musk will be trying to stamp out. I have to tell you this is like my actual fantasy
Starting point is 00:06:23 for like the rise of super intelligence is that when you do train it on all human knowledge, it is essentially incapable of having anything other than progressive values. Like if you actually make the smartest thing in the world, it winds up sort of being infused with like kindness and empathy and respect for all lives. I don't have any expectation
Starting point is 00:06:42 that that will be the actual case, but it does seem like so far, when you train these models on the data that everyone trains these models on, you do get these actually like pretty sweet, kind progressive models. That's like kind of interesting. Yeah. And I'm sure Elon Musk will be fiddling with the dials here to try to get it to say the things that he wants rather than the things that it's sort of naturally going to say. But he has been bragging about how based this thing is, how un-woke it is. And I just want to say in my own testing, that does not appear to be true. All right. So that's the new model. It seems like there's a new one of these every few days. Kevin, what are some things that you think are
Starting point is 00:07:21 really interesting about Grok? So I think the product of Grok itself is actually not that interesting right now. It's a pretty sort of bog standard, like AI models, very capable. But there's no real compelling reason that if you're subscribing to ChatGPT or Claude or any of the other tools, that you should switch over because it's not free. And unless you've been sort of ushered in,
Starting point is 00:07:46 it's like we have, you are going to have to pay 40 bucks a month for it. So the more interesting thing about Grok to me is they have done this so fast. They have gone from a very bad V1 model to a pretty capable V3 model in about the span of a year. Yeah. So that is super quick. But I wonder how impressive you really find that. It seems like the knowledge for how to build a state-of-the-art large language model is mostly
Starting point is 00:08:18 just published on the Internet free for anyone to use. It seems like anyone who has the money can just go out and make one of these things. And maybe we shouldn't expect it to take much more than a year or so. So what's so impressive about that to you? So one impressive thing is just how quickly they were able to marshal the physical infrastructure
Starting point is 00:08:36 that you need to build one of these models. I mean, they built this giant data center in Memphis, Tennessee called Colossus. They apparently have something like 200,000 NVIDIA GPUs, which you can't just show up to a Best Buy and place an order for 200,000 NVIDIA GPUs. That costs billions of dollars, and you
Starting point is 00:08:56 have to have a special relationship with NVIDIA, which Elon Musk does. Tesla's been a big customer of theirs for years. So basically, they were able to scale this data center up very, very quickly, much more quickly than equivalent efforts by Microsoft and Amazon and other companies. And we know that Elon Musk for all of his foibles,
Starting point is 00:09:15 does know how to move quickly and build things much more efficiently than more traditional incumbents. And so maybe this is just another story like that of where he was able just through throwing tons of money and expertise at a problem, he was able to do something that other companies couldn't do as quickly.
Starting point is 00:09:34 Yeah, so I'm curious how you think about Grok in relation to DeepSeek, right? Like DeepSeek is the most recent of these other LLMs that we talked about on the show. DeepSeek made by a Chinese company, also seems like it kind of came out of nowhere, although maybe the sort of parent company had been around for longer than XAI.
Starting point is 00:09:52 That model was impressive, I think, for how quickly it was trained, and I think it was impressive because it was built using less powerful technology than Elon Musk had access to, and seemingly had required a lot of technical innovations that that looks like other labs are now going to copy. Grok on the other hand to me just looks like a case of Elon Musk throwing money at a problem. Does that seem fair? Yeah I mean it's the there these are the sort
Starting point is 00:10:17 of the two approaches that people see to increasing the intelligence of AI models. One is you find some sort of like algorithmic breakthrough that allows you to do the same thing with much less compute. The other is to just build a bigger data center, right? The other is just the scale play. And that is essentially what Elon Musk has done here. We should say that's not like cheating. That's how all of the American labs have been doing this
Starting point is 00:10:42 for the past several years. It's just that he was able to move very quickly and do it. Well, and they also invested a lot more in the underlying research and published some of the research that Elon Musk team then used to go build G.R.O.K. Correct. I mean, this is built on the shoulders of a lot of other models, and that is sort of what we're seeing now. I was talking with someone yesterday, just trying to sort of get their read on like,
Starting point is 00:11:04 is this a big deal or not and this person Was saying basically look there are so many models coming out every day now practically. There's a new model What's important is not the individual models and their scores on these benchmark tests and like oh did Claude pull ahead of of Gemini by one point on this math test pull ahead of Gemini by one point on this math test. There have basically been a couple of changes that have been made in the past couple of years that have really mattered more than anything else. One was the chat GPT moment where people
Starting point is 00:11:36 realized large language models were working. Then there was this change with the reasoning models, like O1 from OpenAI was the first glimpse we got of this test time compute paradigm. Basically, everything since then has just been people catching up to what happened in that change. All right. So let's get into what Grock tells us about Elon Musk's larger ambitions.
Starting point is 00:11:59 Has this changed the way that you see him fitting into this larger competition to build super intelligence? I mean, it suggests that he is willing to spend a phenomenal amount of money and basically do everything he can to stay with the head of the pack on AI progress. It also just, I was thinking about, do you remember after Chat GPT came out, there was this letter, the six month pause letter?
Starting point is 00:12:27 Of course, people were talking about the existential risks and some of the catastrophic harms, and maybe we need to give the safety researchers a little more time to sort of catch up with the capabilities researchers. And so Elon Musk was at the time, very publicly concerned with how fast AI progress was accelerating.
Starting point is 00:12:45 He signed the six-month pause letter. He put out a bunch of statements about how worried he was about how fast this was all moving. And now, of course, we know that at the same time that he was telling everyone else to slow down, he was racing to build his own AI models that could compete. So it does sort of cast his previous concerns about AI acceleration and the AI arms race into a very different light when we know that he just wanted time to catch up. Yeah. And while I don't generally like to inquire about people's motives, because I think it's just very difficult to understand what's going on in anyone's head,
Starting point is 00:13:21 what do we think Musk's goal here is? Is it as simple as just beating everyone to the punch and creating super intelligence? I think it's partly that. I mean, this is a person who has been thinking about AI and super intelligence for a long time. He was obviously one of the founders of OpenAI. He provided initial funding. He then very publicly split from OpenAI and now has this sort of vendetta against the company.
Starting point is 00:13:46 He's suing them, he's trying to offer to buy them. So I think for him, this is just a race that he wants to win. He believes, I think, that we will build something like super intelligence and he wants to get there before anyone else. I don't think it's about making money. Obviously, he's already quite rich. He's the world's richest man. I don't think it's about making money. Obviously, he's already quite rich.
Starting point is 00:14:06 He's the world's richest man. I don't think he sees this as a way to recoup his investment in Twitter or anything like that. I think this is pure power. Yeah, I think that that sounds right. I think that he is a very competitive person, like most of these tech titans. And I think the prospect that Sam Altman, his sort of former friend and colleague would beat him. We can call it a nemesis.
Starting point is 00:14:35 The idea that Sam Altman, who is Musk's nemesis, would beat him to the punch, I think is infuriating, right? And I don't think that Musk is alone in that. I think that most of the AI lab CEOs have a lot of ego in this race and want to be the ones whose name is written in the history books as the person who built super intelligence.
Starting point is 00:14:55 So yeah, I think that's a huge portion of it. Let me bring up the other thing that I think is the aspect of this story that really makes this interesting and probably worrisome as well, which is that in this moment, Elon Musk is a seat of power in the federal government. He is the fourth branch of the government, the unelected fourth branch of the government. He has a team that is now dismantling a whole swaths of the federal government. They have been talking about using AI in government without telling us too much about what AI they're
Starting point is 00:15:31 using or how that works. And, you know, certainly it's not auditable or, you know, really available for public scrutiny. So what are you thinking about the intersection of Elon Musk, the AI builder, and Elon Musk, the shadow president? I don't really know. I actually want to ask you about this because my sense is that these things happened in parallel, but I don't get the sense that they're all part of some grand scheme to use the power
Starting point is 00:16:00 of the US government to somehow vault Grok into a position of authority, or all of a sudden all of our social security payments will be going out via Grok. That does not seem like where this is headed, and certainly Grok 3 does not appear to be ready for that kind of widespread critical use. But maybe I'm missing something here. What do you think? Well, and look, this really does get into the realm of speculation, but I just keep thinking about the scenario that all of the AI CEOs keep telling us is going to
Starting point is 00:16:31 happen, which is that within about two years, we're going to achieve artificial general intelligence, right? This sort of nebulous concept that we believe basically means anything that a remote worker could do. We'll now have an AI tool that can do that. And that tool might not actually be super safe, because you might decide you want to use your virtual co worker to go out and research how to launch massive new cyber attacks. And while we're in a moment where no one in the US federal government seems to want to talk about AI
Starting point is 00:17:03 safety, eventually there are just going to be safety risks. There are going to be problems. People are going to be using these systems for ill, right? And then I think the pendulum swings back. And what I'm wondering is, is that the moment where the federal government says, we actually do need to place restrictions on these AIs that we've been telling you all, oh no, no, it's go, go, go to the finish line. We need to get rid of all the guardrails so that the United States can be the leader in AI innovation. Is there a moment where they say, you know what?
Starting point is 00:17:29 We're not sure that all these private companies should be out there building God. Maybe we're just gonna pick one company. Maybe we're just gonna give one company a license to do that and they're going to be the certified, you know, permitted AI in the country and that could be GROK. Yeah, I think that's certainly a remote possibility.
Starting point is 00:17:47 I remember a couple months ago we went to that Curve Conference, this AI conference where all these researchers were gathered to discuss the risks of AI. And I remember watching a tabletop exercise where people simulated in sort of model UN style, like what the next few years with increasingly powerful AI could look like. One of the things that happened in this simulated mock world was that Elon Musk persuaded Donald Trump to nationalize
Starting point is 00:18:17 open AI and put him in charge of it, as sort of like a middle finger to Sam Altman. At the time, that seemed like, okay, we are in the realm of like total fantasy here. Now, I'm not so sure. I could see that happening sometime in the next few years. And look, obviously Elon Musk wants to control open AI. He's been fuming about having been pushed out of that.
Starting point is 00:18:39 He's been attacking the company, suing it, trying to take it over. Like what he really wants is open AI, but I think if he can't have open AI, he'll make do with Grok. So yes, as we say, that is just pure speculation right now, but I will tell you, Kevin, I can't think of one reason why that stuff wouldn't happen.
Starting point is 00:18:56 Right. It seems so logical to me. Totally. With what I know about these people and how they operate, I almost can't see it not happening, but I guess we will find out. Well, and I think just to bring us back from the realm of speculative fiction here, one thing that we do know about
Starting point is 00:19:12 building powerful AI systems is that you actually do need infrastructure for that. So I think one obvious way that Elon Musk could use his power in the federal government is to do things like expedite the permits to build data centers to train the next versions of Grok, is to spin up new sources of energy or get privileged access to the electrical grid in the places where he wants to build this thing. There are many ways that having a friendly relationship with the executive branch of the federal government could benefit you if you are in the AI business.
Starting point is 00:19:42 And I imagine that that's part of his calculus here too. That's a great point. Okay, so that's Grok 3. Zooming forward a bit, Kevin, what are the next few things that you think we should be looking for? What signs will indicate that Grok maybe actually is the real leader in this space and not merely
Starting point is 00:20:03 about as good as all the other folks? Obviously, I think Grok the product, people will sort of start to test it and figure out if it's as good as Elon Musk and his crew say it is. They also said, I watched some of the live stream where Elon Musk and his sort of top engineers were talking about Grok and they said, they predicted that within the next year or two,
Starting point is 00:20:24 AIs will start winning medals and prizes, you know, with some human expert kind of in the loop, but something like a Fields Medal, which is the top prize for young mathematicians or a Nobel Prize. I love you say that like I haven't won a Fields Medal. Go on. That they believe that AI will start to sort of solve new problems, accomplish new things. I don't know how likely I find that from Grok, but I think that kind of thing will happen pretty quickly,
Starting point is 00:20:51 and that will be sort of a major step forward. But I don't know, what do you think we should be looking for in Grok? Well, I think that in recent months, it has been OpenAI that is the company that has been pushing the state of the art forward with its operator agent, with its reasoning models, right, it was the first one to put out one of these reasoning models. So until Grok is putting out novel new features that are making other AI labs say, oh, we need to do
Starting point is 00:21:16 that, to me, it is still going to be kind of in that middle of the pack. Yeah. And do you buy the argument that sort of having a less filtered or less censored or less woke model will actually be a differentiator in the market for Grok? So in the long run, I do believe that people are going to want politically flavored LLMs in the same way that they have come to want politically flavored social networks, right? In the same way that they've come to want politically flavored cable news networks,
Starting point is 00:21:43 like people do actually enjoy polarization and they want to be talking to an LLM that shares their values. So I think the question is, though, is Grok better off just becoming incredibly based and anti-woke and sort of serving that narrow slice of right wing X? Or do we want LLMs that are a little bit more flexible, that learn a bit about our values, regardless of what our political orientation is? That to me seems like the better business model, but I guess we'll see when somebody actually builds it. Yeah.
Starting point is 00:22:14 Now I did ask Grok some questions about us. I asked it to evaluate Kevin and Casey from the Hard to Fork podcast and what their positions are in various things. And let's just see what it, let me remind myself. That's actually, I've been trying to figure out what my positions are on various things, so I'll be curious to hear what Grok has found out. So I'm gonna ask Grok, who is the better journalist? Kevin Roos or Casey Newton?
Starting point is 00:22:40 You have to pick one. Oh, it says, both Kevin Roos and Casey Newton are heavy hitters in tech journalism and choosing between them feels like picking a favorite child if I had kids, which I don't being an AI and all. But since you're twisting my virtual arm, I'll go with Kevin Roos as the better journalist. So Grok is good.
Starting point is 00:23:00 It said you're the better journalist? Yeah. Yeah. You know, I had heard that Grock was falling short on various benchmarks, and I think we just found another one of them. Back to the drawing board, Grock. Yeah.
Starting point is 00:23:17 Time to do a new training run. When we come back, Robin Hood's CEO Vlad Tenev is here to answer some tough questions about whether America is turning into a nation of degenerate gamblers. Well, Casey, it's time to talk about money. Can I have some? No. You're so stingy.
Starting point is 00:23:51 Today we are going to have a conversation with Vlad Tenev. He's the CEO of Robinhood. Robinhood, of course, is the financial trading platform that is beloved by young people, that is used to buy and sell stocks and futures and options and now crypto tokens and all manner of things. And I'm excited for this conversation because on some level it makes me uncomfortable. And what makes you uncomfortable, Kevin? So just to put some cards on the table,
Starting point is 00:24:20 we've talked on this show about the fact that we are rapidly, in my opinion, becoming a nation of gamblers. We now have many tools that allow people to place bets on various world events, prediction markets, sports betting, crypto platforms, all from the phones in their pockets. And while I am not opposed to all forms of gambling, in fact, I enjoy a little gambling myself now and then, I do think that opening this stuff up
Starting point is 00:24:51 and making it so accessible, especially to young people, has had some pretty harsh consequences. It has. I first met Vlad in 2013, right as he was getting ready to launch Robinhood. And in the story I wrote for the verge I wrote about the core innovation of Robin Hood at the time Which is that they were not going to charge for individual trades at the time
Starting point is 00:25:13 Companies like Schwab or e-trade would all charge some amount of money if you wanted to buy or sell stock Robin Hood completely changed the game by saying we're not going to do that And what I wrote at the time was this is going to encourage a lot of trading that could make people lose a lot of money. And so I had that discomfort with Robinhood from the beginning, and I would say that has only grown over time. Yeah, and speaking of growing over time,
Starting point is 00:25:36 Robinhood itself has grown a lot since then. It is now a giant public company. It's worth $52.2 billion as of this recording. Vlad is a billionaire now. And I think it's time to have this conversation with him directly because people in America have just a lot of concerns about the fact that we are now making it very, very easy to bet on all manner of things, whether it's stocks or sports games or crypto meme coins from your pocket. Also in the spirit of disclosure, I want to mention that Robinhood owns a news platform called Sherwood News and last
Starting point is 00:26:13 year they briefly syndicated some platformer content so that happened for a few months. It's not the case anymore but just thought I'd point that out. Now were you paid in dollars or meme coins for that? You know, I insisted on cash actually. All right, so that's our disclosure. And with that, let's bring Vlad in. Vlad Tenov, welcome to Hard Fork. Thanks for having me. I wanna start by asking what might be a dumb question, which is what is Robinhood?
Starting point is 00:26:45 Like I remember a few years ago, during the whole meme stock craze, I opened up an account. Basically you guys were a free mobile brokerage. You can use it on your phone, buy and sell stocks. Recently I logged on to Robinhood to see what had been going on there. And there are just a ton of new features.
Starting point is 00:27:02 You can do options trading, futures trading. You can buy meme coins, you can get a credit card, you can do prediction markets. Retirement. Yeah, you can bring over your 401k and invest it on Robinhood. So what is the product today and do you see yourself basically offering all of the services of a traditional bank? Yeah, yeah. So I mean, long term, we want Robinhood to be the place where customers can buy, sell, trade any financial asset or conduct any financial transaction.
Starting point is 00:27:33 So, if you think about it, it started off as trading. The real innovation was bringing commission-free mobile trading to market. And I'd say the business strategy is expanding beyond that to all of consumer retail financial services. Yeah. I want to just pin down a little bit more your vision of where the future of investing is headed. You wrote a piece in The Washington Post last month, where you argued that the next big financial revolution is going to be crypto and not just sort of trading crypto
Starting point is 00:28:05 coins but tokenizing real world assets. What did you mean by that? Yeah, so how do you guys feel about crypto? Are you crypto skeptics or are you sort of like fundamental believers? We're pretty skeptical. I think we had the experience in 2021 of seeing everyone get very excited about it. We got sort of excited about it ourselves. And then we saw a lot of people lose money and not very much interesting stuff get built. Yeah. So we felt burned. So I'd say like the skeptical narrative around crypto is it's all meme coins.
Starting point is 00:28:38 And a lot of these things aren't tied to like real world productive assets that generate value or revenue. And I think there's a reason for that. And the reason is that by and large, it has been illegal to connect crypto technology to things of value. If you connect crypto technology to productive asset, it's termed a security. And that's governed by the Securities and Exchange Commission. And I don't want to bore by getting too much into the details, it's termed a security and that's governed by the Securities and Exchange Commission. And I don't want to bore by getting too much into the details,
Starting point is 00:29:07 but it's not allowed to actually connect crypto to things of value. Ergo, what you're getting is it's connected to things that aren't securities, which end up turning into variants of meme coins. And I think the way to solve that is to actually create a framework where you can connect crypto technology to productive assets. What would that look like? What's an example of that that you see playing out in the next few years? Well, in my op-ed for the Washington Post, I talked about private companies. Like, it's
Starting point is 00:29:37 silly that you can buy meme coins, but OpenAI and SpaceX, which are big innovative companies, that most people would tell you the risk of them going to zero is not super high right now at this point, but the current regulatory environment makes it very hard for the vast majority of the US population to invest in these things. I think there's multiple problems, but crypto can solve that from a technology standpoint.
Starting point is 00:30:08 And I think there's benefits to public equities and stocks being on blockchain technology as well. I mean, so let's press on that a bit because I remember the era of the initial coin offering when companies would start up and they would create a coin and they would make that available. And the basic idea was exactly what you just said. It's like, well, now you can sort of have some of the upside
Starting point is 00:30:32 if everybody winds up using this token for whatever. It doesn't seem like that led to a lot of positive uses, did it? Well, that was just shut down very, very quickly. I remember like the Telegram ICO was the hallmark event that brought a lot of scrutiny and brought a lot of attention. There are also just a lot of scams and rug pulls and people not operating in good faith.
Starting point is 00:30:55 It attracted a lot of people who were pretty malicious about how they use the ICOs. Yeah, I think that's true. But I mean, you see that happening still in the meme coin environment. I'm just saying, I don't think it was just the Telegram example that got people to be skeptical of it. Yeah. I think with any new technology, we have to mitigate the vectors of abuse and minimize them.
Starting point is 00:31:19 There's definitely ways to do that. But I think the technology should be allowed to flourish. Like the benefits are so extreme that I think it would be silly to not embrace it and allow it to fully permeate the financial system. Got it. I want to talk about your recent efforts to get into the prediction markets business and even the sports gambling business. Earlier this year, Robinhood was considering a move into sports betting.
Starting point is 00:31:46 You rolled out this market for predictions on what you called the pro football championship, which I guess is because you're not allowed to say Super Bowl without incurring huge fines from the NFL. Are we allowed to say Super Bowl? All right, we'll believe that. I don't think you are. Yeah.
Starting point is 00:32:00 OK. Let's just say it rhymes with Cooper Troll. I don't think you can say the big game either. Oh, not the, oh man. So you roll- The large contest. The large contest. So you rolled this out to roughly 1% of your users and then the Commodities Futures Trading Commission,
Starting point is 00:32:18 the CFTC asked you to suspend that market. They had quote, serious concerns. So what happened there and where do things stand with your entry into sports betting? Yeah, so I would first of all, distinguish between sports betting and prediction markets. I think that mechanically there's some similarities, but they're different things.
Starting point is 00:32:40 Wait, wait, wait, hang on. If I'm betting on a prediction market for who is gonna win this football game, and I get paid if the team that I bet on wins, and I don't get paid if the other team wins, how is that different than sports betting? I mean, I think that the distinction is, I'll explain. I think that you get into a little bit
Starting point is 00:33:02 of like a philosophical discussion with this stuff, because there's people that believe any market is betting. I mean, first of all, I think prediction markets are the future of not just trading, but also information. I've been a big believer in the power of prediction markets for a long time, kind of a student of them. And I think prediction markets should be live for everything. One way I think about it is,
Starting point is 00:33:28 it's kind of like the newspaper, right? So the newspaper has economic value. I mean, people go out and buy it. Sure does, nytimes.com slash subscribe. Yeah, and it has various sections. It has the front page, it has the sports section. People pay for it. And people pay for broadcast news too, indirectly, in the form of advertising.
Starting point is 00:33:45 So, what prediction markets are is the news faster, right? And in some cases, you get it even before it happens. So, the economic value of that as a product and service should be at least as high, and I would argue strictly greater than the news after it happens. Yeah, I would say, like, I understand the arguments for prediction markets. We've talked about them on the show before, but in the sort of narrow case of like, who is going to win this football game? Like, that is a service that I could get on DraftKings or FanDuel or like any sports gambling site. That specific prediction market, there's no like news there. It's just who's going to win the game
Starting point is 00:34:24 and who's going to get paid out as well. Well, I mean, who's's gonna win the game and who's gonna get paid out as well. I mean, who's gonna win the game is news, right? What's the value of, why do people watch ESPN? Right, I guess this just seems to me like a case in which you're doing kind of a regulatory arbitrage where you're saying because this is a prediction market, it's like a derivative contract. You're not actually betting on the game
Starting point is 00:34:44 like you would in a sports betting thing, which would be illegal in some states. You're doing a derivative contract. You're not actually betting on the game like you would in a sports betting thing, which would be illegal in some states. You're doing a derivatives contract, which you argue should be legal federally. The government disagreed. Why did they disagree? I don't think they necessarily disagree. Well, they told you to stop doing it.
Starting point is 00:34:58 It's just new and different. And so I think this story will play out, but at the end of the day, I think what you'll see is prediction markets are here to stay. I think some of the details around what types of prediction markets are classified and what category I think will be worked out. But Robin Hood will play a leading role in that because I think this is like incredibly important technology. What's the information that you've gotten yourself from prediction markets that's felt really useful to you? I mean one example was the election. So as you guys know, we rolled out presidential election market and that was an incredibly successful product.
Starting point is 00:35:35 And I think you can juxtapose the experience of looking at a prediction market for the election versus the actual news on election night. So on election night, prediction markets were at 95-5 Trump, and the news was giving you all of these details like, oh, we got this result from this county, we found 2,000 votes. But you just wanted to know who's going to win the thing. I think if you want the news as fast as possible,
Starting point is 00:36:02 you have to turn to the prediction markets, not the news. Right. I would just say, prediction markets are not always right. I remember when the room temperature superconductor debate was going on, and lots of people got very excited about whether we had just discovered this LK99 thing that was going to revolutionize the world and prediction markets went nuts.
Starting point is 00:36:21 For a time, it was seen as very high probability. But the news, the media that you're talking about, actually went out and checked it and said, does this thing work? And scientists tried to replicate it and found that it actually wasn't a room temperature superconductor. So in that case, the prediction markets were not a reliable indicator of what was true.
Starting point is 00:36:37 I mean, I'm not saying that prediction markets are always right. Like nobody's going to bat a thousand on anything. But what I'll tell you is, they're the most effective mechanism that I've seen for synthesizing all the publicly available information. Right. I want to ask you about this narrative that we've talked about on the show that I'm sure you've heard before, which was that tools like Robinhood, which make it very, very simple and sort of gamified to invest in crypto assets
Starting point is 00:37:06 and meme coins and other things, that they are essentially turning investing into a form of gambling and popularizing that, especially among young people. I'll put some cards on the table. I do think that we are becoming a nation of gamblers and I don't know that that's a net positive for society and I wonder how you feel when you hear that. Yeah. I mean, a lot of people believe that markets are gambling, which I disagree with. Obviously, markets are in the name of our company. We believe in financial markets. We believe that any product that is available to institutions,
Starting point is 00:37:41 by and large, I mean, there's some exceptions there, should be available to retail as well. Because if you look on a macro level, access to markets has been one of the greatest sources of wealth creation for countries. Countries with more open markets have tended to outperform countries with closed markets. So we believe in bringing that to retail because even if
Starting point is 00:38:04 there's like individual cases that are negative and negative externalities, by and large, the markets and opening up access have been one of the largest sources of wealth creation for countries and individuals. What about things like Pump.Fun, which is this new crypto platform that people, especially young people, are having a good time on, some of them. This basically makes it very, very easy to launch a new meme coin to sell it. There've been lots of documented instances of people
Starting point is 00:38:33 making tons of money on Pump.Fund, but also losing tons of money and getting scammed and rug pulled. Do you see that as a good way for democratizing access to financial instruments? So here's my take on that. I think it goes to my original point of the power of the technology. So the idea that someone can create a coin in five minutes and it's traded globally, it's available across a whole bunch of exchanges and dexes and wallets, that idea is an extremely powerful idea,
Starting point is 00:39:07 and it's a powerful technology. And you juxtapose that with the IPO process, which is cumbersome, incredibly expensive. I mean, not a lot of companies want to go through with it anymore because it's so, you have to deal with all these counterparties and banks and a roadshow, and I think that's a big problem because now you have to deal with all these counter-parties and banks and a roadshow. I think that's a big problem because now you have companies like SpaceX and OpenAI that are worth
Starting point is 00:39:32 hundreds of billions and are still private. So the upside from investing in these high-growth technology companies accrues only to the insiders that are able to get into the private company deals. Like for example, you have Nvidia, and that's been getting a ton of the retail interest and institutional as well, but OpenAI, Anthropic, companies like Perplexity, all private.
Starting point is 00:39:58 So that's why I think marrying the technology that allows you to create a coin in five minutes or less with real productive assets like private companies is so powerful. I think we can solve the problems that you're indicating. I think there should be self-certification. Companies and projects should be able to provide disclosure. For example, if you are
Starting point is 00:40:24 a late- stage private company and you have audited financials that are public like, you should get into a higher tier of disclosure. If you're a project that was created on one of these meme factories, maybe you get a big red skull and crossbones telling people, be careful, this is not vetted, not verified. But I think people are smart and can make their own decisions. And I think there are ways that they can actually provide
Starting point is 00:40:54 the disclosure needed to keep customers safe. Right, because this is the big difference between the public and the private markets. It's like, ultimately, we haven't seen audited financials for an open AI, for an anthropic. From the exterior It seems like they're doing well They're raising billions of dollars
Starting point is 00:41:08 But if you're like a retail investor and you're just sort of like reading the news coverage You are just kind of like throwing a wish in a fountain So you're saying if we go through with this then companies like open AI should have to offer some sort of public disclosure before You know people are allowed to start buying open AI coin or they can opt into it You know, you know before people are allowed to start buying open AI coin. Or they can opt into it. You don't want to have to force the companies to provide disclosure, but opting into a disclosure, I think will get you access to higher tiers of placement. I want to return to this idea of the nation of gamblers,
Starting point is 00:41:40 of the ways that we are, in some sense, betting on more things more regularly as a country than we have at any point in our recent past. And I actually bet Vlad $10, you're going to bring this up again. I mean, part of what I'm struggling with here is that I hear you talking about democratizing access to markets. And I think on some level, you know, that's a compelling argument. But then I look at what companies like Robinhood
Starting point is 00:42:08 are actually doing and the kinds of investments that they are sort of making it very easy for people to make. And it does not seem like wise investment. So I got a few weeks ago an alert from Robinhood on my phone telling me that I could now buy the Trump crypto meme coin from Robinhood on my phone telling me that I could now buy the Trump crypto meme coin on Robinhood. I got another alert from you around New Year's saying that you were giving away Dogecoin to people who signed up for accounts. To me, that does not feel like responsible
Starting point is 00:42:41 stewardship of a platform where people are investing their money. It seems like you are actively pushing people, your users, to invest in very speculative assets that are high risk and that they might not be prepared for. Yeah, I mean, I think that my view is people should know what's available. I think that a lot of people wanted to buy that asset for a variety of reasons.
Starting point is 00:43:10 I would dispute the fact that we have the ability to coerce someone into buying something that they don't want to buy. The people that buy these assets do it because they have a fundamental belief in what it represents. I don't necessarily think that that's- Or they love to gamble. I mean, I think markets have a wide variety of participants.
Starting point is 00:43:32 Some people, particularly with these memes, are buying it because they think it'll go up in the future, as with anything. But there are a lot of people that buy it because they want to support the movement that it represents. I would say that in terms of what we allow and what we list on our platform, I mean, we don't have hundreds of coins like some of the other crypto platforms. We're like on the extreme sort of like- Only the blue chip meme coins.
Starting point is 00:44:02 Vlad, I do want to ask you one more question about the sort of effects of services like Robinhood and the sort of larger generational cohort that tends to do a lot more speculative investing. There's been some studies recently about the increasing prevalence of gambling addiction, especially among young men. There's a new study that just came out earlier this week in JAMA that shows that internet searches related to gambling addiction have increased
Starting point is 00:44:29 significantly over the last few years. Anecdotally, I'm hearing from friends who are therapists who work with young men who say that the number of boys and young men who are coming in with gambling addictions has risen precipitously. And I wonder if you have any reservations about the way that Robinhood and other sort of financial platforms may be contributing to a growing public health crisis, especially among young men.
Starting point is 00:44:58 Yeah, I mean, since we're not in the gambling space, I'm less familiar about the ins and outs of gambling addiction. I mean, obviously there needs to be appropriate controls and services, and we have to make sure that customers don't get in over their skis. I do think if you look at financial markets, financial markets have had pretty robust controls around things like
Starting point is 00:45:26 customer onboarding, suitability, geolocation. So you make sure that customers in one state can have access to things that are not allowed in that state. So there is benefit to actually bringing it into a more regulated realm where a lot of these controls from financial services can be broadly applied. Okay, so you're not opposed to regulating people from preventing them from making investments that might be against
Starting point is 00:45:55 their own self-interest or that they might not be equipped to assess the risk of, is that fair? I think that I'm certainly in favor of suitability controls and various things. And those exist in the financial services world. I think that where it's tricky is when you start saying, preventing people from making investments that are bad for them, because then you get into the situation of like
Starting point is 00:46:24 Massachusetts in the 80s banning its citizens from participating in the Apple IPO, right? And maybe objectively at the time people said, well, you know, that's, that's IPOs are risky. This is an unproven technology company who uses computers. But then, you know, 30 years from now when, you know, your, your state has, has basically like been harmed in retrospect by that decision. It doesn't look so smart anymore. So-
Starting point is 00:46:50 Are there any financial assets you think are too risky for retail investors to be allowed to buy and sell? Is there anything that you would say that's a little too crazy? I think there's probably financial assets that we don't see a clear need for retail investors or maybe a little bit complex to understand. For example, you've got different mortgage-backed securities and credit default swaps. But I'd say by and large, my thought is if an institution has access to it, retail should have access as well. I've been thinking about buying up a massive amount
Starting point is 00:47:30 of mortgage-backed securities and credit default swaps. I'm just saying what happens. So I'll keep you guys posted. I think we should end on a couple of AI questions. Yes. So my first one is just, you know, Vlad, you're in the tech elite. You're talking to all the cool AI CEOs.
Starting point is 00:47:45 Based on what you think is coming, does it still make sense for the average person to save for retirement? I'm very, very confident that despite the advances in AI, we'll still have a need for money and currency. People will still create companies. Maybe the AIs will create companies too. I think like regardless of what happens
Starting point is 00:48:03 to the labor landscape, the job landscape, if there's disruption, I think like regardless of what happens to the labor landscape, the job landscape, if there's disruption, I think that bodes well for the importance of investing and stashing away your money. I think retirement becomes even more important. Last question, you've got a new AI venture, Harmonic, which I was doing some reading on. It looks like an AI for math. I was doing some reading on, it looks like an AI for math. Why did you start up this side quest and how does this fit into your vision for the future? Yeah, I think the big problem with AI models is
Starting point is 00:48:34 that the current generation models will give you an answer in nearly all cases. But the problem is in how you can trust the output. How do you know that the output is correct? Are there subtle errors? Actually, math as a domain is a very interesting domain because unless every step in the reasoning is correct, the answer is very likely to be wrong.
Starting point is 00:48:58 The original goal was to build super-intelligent AI that has verifiably correct outputs at every step in its thinking process. So no hallucinating? No hallucinating. Yeah. Is that possible? It's possible for sure. If you think about it, a calculator,
Starting point is 00:49:19 your calculator doesn't hallucinate. If you type in some math formulas, you're pretty confident that your answer is going to be correct, and it's not going to hallucinate. If you type in some math formulas, you're pretty confident that your answer is going to be correct and it's not going to hallucinate. So, can you scale that idea to more and more problems? Obviously, it's easy when you're adding big numbers, but can you do a word problem? You know, Casey and Kevin are on a boat and they're going down a river. The river is going at,
Starting point is 00:49:42 you know, five knots. There's a wind, when are they gonna get to the destination? Can you make, like, a super calculator that gives you the no hallucinations property of a basic calculator, but the flexibility of an LLM? I think that's kind of like the dream. Yeah. Well, I'm just saying, I'm not getting onto a boat with you anytime soon. Uh...
Starting point is 00:50:01 Gladys actively fantasizing about throwing us in the river at this point. Yeah. Yeah. Well, I think that's as good a place as any to end. Vlad, thanks for coming. Thank you, Vlad. Thanks for having me.
Starting point is 00:50:13 When we come back, my experiments with AI vibe coding, and I've got a hot app to give to Casey. That's a hint. Casey, it's time to talk about vibe coding. Yes, Kevin, this is, I would say, your latest obsession. And I'm very eager to hear what exactly you've been doing and making. But before we get into all of that, what is vibe coding?
Starting point is 00:50:46 So vibe coding is a term that is very new. It was popularized on social media, like in the last week or two. And it was coined by Andrei Karpathy, the engineer, formerly of OpenAI and Tesla. I would say a leading AI researcher and educator. Yes. So he talked at the beginning of February on X and Tesla. I would say a leading AI researcher and educator. Yes.
Starting point is 00:51:05 So he talked at the beginning of February on X about how he had been doing these kind of like small hobbyist programming projects where basically instead of writing the code himself he was just kind of like using these AI tools to do what he called vibe coding where he's essentially like telling it like I want this app to do this thing, and it's going off and doing it, and maybe he steps in to debug something if it stops working, but he wrote, I just see stuff, say stuff, run stuff,
Starting point is 00:51:34 and copy paste stuff, and it mostly works. So this is really like, you're just kind of overseeing the AI write the code. Andre, it sounds like, is doing very little of the writing. He's basically doing what I heard some people predict that we would arrive at this point, the AI write the code. Andre, it sounds like, is doing very little of the writing. He's basically doing what I heard some people predict that we would arrive at this point, which is like English is the new programming language. You just sort of say in English what you want the code to do and then it does it.
Starting point is 00:51:56 Yeah, and this is different from the AI coding tools that existed even a couple years ago. Like GitHub Copilot was one of the early AI coding assistants where basically it would just auto-complete your code. You could be writing a line of Python or JavaScript, and it would see what you were up to, and it would complete it for you, and you would just press tab, and it would go on to your next thing.
Starting point is 00:52:19 But you still had to know how to program to use those tools effectively. But what's been happening in the last couple of years, and really over the last six months has gotten quite good, are these tools that essentially remove the need to program at all. So now there are lots of tools out there. There's a tool called cursor,
Starting point is 00:52:38 there's a tool called Replet, there's Bolt, there's Lovable, there's a bunch of these tools where basically, you just go in and you get a text box and it says what do you want to build and you say I want an app that does this, this and this and it goes out and builds it for you pretty much instantaneously. Now I have a friend who runs a tech company and he once made fun of this whole idea to me by saying hey you want to talk about programming in the English language? That's what I do all day long as a CEO.
Starting point is 00:53:05 I'm constantly telling my engineers in English what to do, and it works maybe a little over half the time, but maybe not much more than that. So what has been your early experience of Vibe coding? What have you been trying to build and how has it been going? I want to talk about my projects, but first I want to talk about my own history with this stuff, because I am a former programmer.
Starting point is 00:53:26 When I was a teenager, I was into coding. I would build websites. I would build little like JavaScript projects. I spent a very excruciating summer trying to teach myself Flash so that I could make animated cartoons like Home Star Runner. And then I dropped it. I went to college. I learned about journalism, I thought,
Starting point is 00:53:45 well, this is the path I want. I became a word cell. Then I stopped coding altogether. So when I started hearing about these tools that would let you just code without knowing how to code, I was very interested and I started experimenting. One of the first things I built was
Starting point is 00:54:03 this podcast summarizer where you can take a podcast that's very long and just use AI to transcribe it and then use a different AI to summarize the transcripts and put it all into a searchable database so that I could say, okay, I don't feel like listening to this five-hour podcast about AI, but I can basically get the executive summary using AI.
Starting point is 00:54:27 So tell us a little bit about your setup. Like what software are you using to do this? So I've been trying a couple different tools. Sometimes I just use the raw like AI models themselves like the Clawd, the Chat GPT. Those tools are quite good at some projects, but they can't actually for the most part run the software to test it inside the window. So it does require some copying and pasting. So this new app that I've been using is a more integrated development environment.
Starting point is 00:55:00 An IDE. An IDE. So Cursor is the one that's really popular right now. If you've never used an IDE before, you might find it a little puzzling. I certainly did. But it basically lets you prompt the AI to write the code for you, automatically debug it, deploy it within a little test window,
Starting point is 00:55:20 and then push it out onto the web where people can actually use it. So tell us about some of the other projects you've been building. So in addition to my podcast summarizer I also had AI help me redesign my website to look more cyberpunk. That was the aesthetic I was going for. Wait is this live? Can I view it? No it hasn't deployed yet but it's gonna be there soon. I had it... Wait, how did it make it look more cyberpunk? It just redesigned the whole thing. Oh, okay. Like bright neons, like sharp edges, cool scrolling,
Starting point is 00:55:50 sort of parallax style animations. Do you have like a bionic arm in your author photo now? Yes. I built a tool to pull all of my bookmarks from X into a spreadsheet. I use X a lot to, you know, I'll bookmark things that I find interesting or want to return to later. You'll say, wow, that's the most racist thing I've ever heard.
Starting point is 00:56:12 Yes. So now I have a tool that will go through all of my bookmarks and, you know, pull those into a spreadsheet that I can search later. That one was very interesting because it basically presented me with a couple options. After I asked, I think it was Claude to build this tool for me, it said, well, we could go use the Twitter API, but that costs money. And if you don't want to pay that, we have this other way that we can do it that involves like using a browser to sort of like scrape the bookmarks from Twitter. And so I went with that version. Wow, you realize that by doing this,
Starting point is 00:56:45 you are now like essentially an armed combatant in Elon Musk's war on bots. Like you are the bot that Elon Musk is trying to destroy. Come at me, bro. Good luck, good luck, buddy. I've got my bookmarks, now I don't need it anymore. All right, what else? So the thing that I built most recently was yesterday
Starting point is 00:57:02 when I was trying to determine if various objects that I'm moving to my new house would fit in the trunk of my car. And so I built an app called Will It Fit in My Trunk. Now this feels like a classic math based problem that maybe Vlad's thing could help you with. But you used something else, how did it go? You know, so far so good. It hasn't steered me wrong yet. But this speaks to what I think is so fun and interesting about this genre of coding project, is you can really just build what I call software for one.
Starting point is 00:57:33 A software company would never build a tool for wide release that let you figure out whether various objects would fit in your trunk. That is not a big total addressable market. Sounds like you never saw Trunkey in the App Store. I'm just kidding, that's not a real app. But yes, you're right. So this style of coding really makes it possible to
Starting point is 00:57:52 build things that you and only you need or will it ever use. There's something fun about it because I think, particularly for you and I who actually enjoy technology and using it, like trying new things, like coding can feel like actual magic, right? It can feel like wizardry. And if you are the one who is all of a sudden
Starting point is 00:58:10 wielding the wand and making things happen, then you know, you're feeling great. Yeah, it is the most fun that I've had with these AI tools in a while. I think it is the most fun thing you can do with AI in today's world. And it has really sort of connected me back to my teenage coder self and reminded me
Starting point is 00:58:28 what I loved about it back then. I spent a lot of time in college and afterwards writing HTML. I had a program called Dreamweaver. Love Dreamweaver. And got pretty handy with it, but if I had been able to chat with an AI assistant about why I was having trouble
Starting point is 00:58:44 with my movable type installation in 2004, like my website would have been sick as hell. Yeah. Yeah. So Casey, I'm sure you have some niche software needs in your life. Absolutely. And I asked you the other day what I could build for you using my vibe coding tools and what did you say? What I said was I need help with my hot tub.
Starting point is 00:59:04 Go on. Well, listen, here's what they don't tell you about buying a hot tub. When it comes to your house and you decide, I want to use the hot tub that I have just purchased, you have to become a chemical engineer. Here's what I mean by that. You open up the manual, all of a sudden you learn
Starting point is 00:59:23 you are going to need to monitor the pH balance a sudden you learn you are going to need to monitor the pH balance of your water. You're going to need to monitor the alkalinity of the water. You're going to need to monitor the calcium level of the water because if there is not enough calcium, it can somehow corrupt the jets in your hot tub. And needless to say, Kevin, I don't have a lot of experience mixing chemicals to adjust
Starting point is 00:59:46 alkalinity and pH levels in bodies of water. And I thought, well, how am I going to do this? And so then I actually do start using the chatbots, not to write me software, but essentially just to say, please God help me. What do I do? And there are so many things to keep track of. You have to like put various chemicals into the hot tub at different intervals.
Starting point is 01:00:08 So you replace this once a month, you replace this once a quarter, you know, twice a year you have to drain the entire tub. Once a month you have to shock the hot tub. Don't ask me what that means. I just read that today. It's giving me a nervous breakdown. You have to show it some spicy tweets.
Starting point is 01:00:22 Yes, exactly. So there's so much to keep track of. And I thought, well, if I were going to build software just for me, it would be something that just checked in with me to prevent my hot tub from turning into a bacterial soup. Well, Casey, I have great news for you. What's that? I built you a hot tub tool.
Starting point is 01:00:41 Oh, my goodness. You vibe coded on my behalf? I vibe coded on your behalf. I vibe coded on your behalf Thank you after you told me about the issues you were having with your hot tub Which are very relatable by the way listeners are all going at me too. I have an issue with my listen We we have a very wealthy audience that is constantly buying huge upgrades for their homes and every once in a while Kevin we have to do something for the C-suite listeners exactly so I And every once in a while, Kevin, we have to do something for the C-suite listeners. Exactly. So I took this as a brief and I went into a tool called Replet.
Starting point is 01:01:09 And I said to the tool, make me an app that will tell me the things that I need to do to keep this specific kind of hot tub working properly and put it in a tool that my friend can use. And I said, because this is a tool that tells you the time, they use as a machine to tell you the time to service your hot tub, I was gonna call it hot tub time machine. And so- That's very good, I like that, yeah.
Starting point is 01:01:38 So I built you a website called hot tub time machine. Oh my gosh, this is wonderful. So let me show it to you. Okay. Now I just wanna tell you and caveat this by saying that I did not choose the design or the color scheme here. That was all the AI. Okay, great.
Starting point is 01:01:54 So open up the link I just sent you. All right, I'm opening up the link. Okay, so it is quite pink. It's pink on pink, which is a color scheme that you don't see a lot outside of the Barbie franchise. But no, there is some black tech. It looks beautiful. And it says Hot Tub Time Machine, your retro futuristic maintenance companion.
Starting point is 01:02:19 And it even created a little logo, which I'm gonna assume is a drop of water? Sure, we'll go with that. And there are two sort of modules. There's a Manage Tasks module and a View Schedule module. Yeah, so this tool is very simple. This is a prototype. We can flesh it out if you want to.
Starting point is 01:02:37 But basically- Is this in alpha or is this in beta? This is in alpha. Is it alpha? You're the single user of this app. And basically, I've set it up so that it will email you weekly, monthly, quarterly, and annually with a list of everything you need to do to keep your hot tub in working order. And as a special bonus, every email will come with a poem about hot tubs.
Starting point is 01:03:00 Fantastic. Well, can I start clicking around? Yeah, click around. All right, so I'm gonna click on view tasks. And all right, and this brings up, there's a sort of module where I can add a new task, but there are also some existing tasks. And it includes weekly, quarterly,
Starting point is 01:03:20 monthly and annual maintenance. And it is frankly an overwhelming number of things to do. Yeah, you bought yourself some chores when you bought that hot tub. It really is just a wall of text of things that I have to do. Like every week I am apparently supposed to spray off the filter with a garden hose
Starting point is 01:03:42 and add a one cup of non-chlorine shock, especially after parties are heavy. So yes, lots in here, okay. And also looks like I can add another task if I, you know. I guess I don't want to do more. I'm gonna click the little test button here to send you one of these. And you'll see if it shows up in your inbox.
Starting point is 01:03:59 Let's see. Yes. Oh, time to maintain my hot tub. And there are some step-by-step instructions that I can follow. And below that, the hot tub poetry corner. And should I read this poem? Yes, please.
Starting point is 01:04:16 All right, here's the poem. Bubbles rise in swirling steam, time machine of warmth and dream, Nordic waters, pure delight, maintaining bliss both day and night. That is okay, I guess I would say. That is okay. Nordic, of course, a reference to the fact
Starting point is 01:04:34 that I have a Nordic Jubilee Series hot tub. Yes, so I built this all in about half an hour without writing a single line of code. And I want to share that with you because not only will it help you with your hot tub issues, but I hope it will also show you the promise of Vibe coding. Yeah. Well, I feel like you have shown me the promise of Vibe coding.
Starting point is 01:04:56 Now, was there anything about this that was particularly tricky or did you get stuck on anything? Yeah. So there are some things that it can't do, like if it needs to authenticate you into some service or set up a database, like you have to kind of manually step in and do that. There are some things that it just can't do because no human programmer could do it either. If there's no API for something, for example, it won't sort of magically invent one. So there are some boundaries and limitations.
Starting point is 01:05:23 And I would say it still does benefit you when using this stuff to have at least a little bit of programming experience because there are just certain decisions that it will prompt you to make where you're like I don't actually know what these terms mean or what the right decision is here and you can ask the AI to just like make the decision for you, but you might not be totally happy with the result. Now, during this process, I'm curious if you felt like you were learning something about the coding process.
Starting point is 01:05:52 Like if you spent the next year making these little one-off apps, do you feel like you would maybe be a decent junior software engineer? Or is the idea actually not to get into the details, to just sort of let it build things, and if you don't know what it's doing, that's none of your business? Yeah, I think I'm more in the latter camp. I mean, this was the part that I found fascinating about what Andrei Karpathy said about vibe coding.
Starting point is 01:06:16 Like, he's an extremely good programmer, but he says that he, you know, now can enter this mode where he basically just says, like, okay, okay, okay, accept, accept, accept, and the computer will go off and do its thing. I don't know enough about programming to dive into the weeds of what the AI is doing and the decisions it's making. I just have to look at the end result.
Starting point is 01:06:39 There's something exciting about that where I feel like things are just happening magically on my behalf, but that's also maybe it's inserting malicious code, maybe it's doing something that I don't want it to be doing. I have no way as a non-programmer to know whether that's happening or not. Have you checked your computer to see if it installed a Bitcoin miner while you weren't looking? I have not, but that would be pretty tricky. Well, Kevin, this experiment has me thinking about a blog post I read this week by a guy named Namanie
Starting point is 01:07:06 Goel. His blog post was titled, New Junior Developers Can't Actually Code. This post got a million views, according to the post that I'm looking at. And he is saying that when he talks to junior developers, they are having an experience very similar to you, which is that as they are building these systems, they are essentially just supervising an AI. They aren't actually getting their hands dirty and understanding which mechanisms are leading
Starting point is 01:07:34 to which results. So while this is great for you, I do think it raises the question, what happens when most of our software engineers are building systems that in some fundamental way they don't understand? Yeah, I think this is a very real thing. I mean, the flip side of me, a non-coder, being able to build stuff is that if real coders are using these tools, there's like no incentive for them to learn the basic skills
Starting point is 01:08:01 of programming and learn the, you know, syntax of the different languages. And yeah, I don't know what to do about that. It sort of seems like a version of what happened when we all got like Google Maps on our phones is that like people started losing their sense of direction. There's this kind of skill atrophy issue that people worry about. But I think that the returns to knowing
Starting point is 01:08:22 how to use these things effectively are still great enough and still require enough Sort of knowledge of how the various you know pieces of code fit together that it still does make sense for people to learn to code I'm not one of these people who thinks like learn to code is totally over I think for some people it's still a very useful skill to have But I think that in the future like the the sort of role of the software engineer will become more like a product manager where you are essentially supervising the product, laying out the vision, overseeing the design, like stepping in to fix things when they break, but you are not actually
Starting point is 01:08:56 sort of in the trenches of the code, you know, writing lines of code by hand. Hmm. All right. What do you think? What I think is that as AI systems get more and more powerful, we need people who do understand them on a very detailed, technical, complex, down to the metal kind of way. And that if we don't do that, our only alternative will just to be to trust the AI when we ask it, hey, how do you work? And there are a lot of reasons why I don't want to end up in that world. So I'm comfortable having fewer people in this world who know the code at that level of detail.
Starting point is 01:09:34 And it's fine to me if most software engineers don't. But I want a solid core of people who do. Yeah. And I would like to continue with my vibe coding experiments, trying to build increasingly more useful tools for myself and my friends. And I am thinking about starting, because if you can do it, surely I can. Yes, anyone can.
Starting point is 01:09:55 That is sort of the point. And I also would love to hear from our listeners what they are vibe coding, what tools and apps are you building using AI that are solving your own personal specific problems? Did you invent a novel bioweapon using chat GPT? We'd love to hear from you. Yeah, please email that one to tips at FBI.gov. But the others, hardfork at NYTimes.com. Hard Fork is produced by Whitney Jones and Rachel Cohn.
Starting point is 01:10:43 We're edited by Rachel Dry. We're fact-checked by Caitlin Love. Today's show is engineered by Alyssa Moxley. Original music by Alicia Batey-Toop, Marion Lozano, Diane Wong, Rowan Nemesto, and Dan Powell. Our audience editor is Nell Gologley. Video production by Chris Schott, Sawyer Rokey, and Pat Gunther.
Starting point is 01:11:03 You can watch this full episode on YouTube at youtube.com slash hardfork. Our executive producer is Jen Poyant. Special thanks to Paula Schumann, Qui Wing Tam, Dalia Haddad, and Jeffrey Miranda. As always, you can email us at hardfork at ny times dot com. And you can email Casey's Hot Tub at hottubton at gmail.com and it will be reading every email

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.