The AI Daily Brief: Artificial Intelligence News and Analysis - Elon Turns On "Most Powerful AI Training Cluster In the World"

Starting point is 00:00:00 Today on the AI Daily Brief, is it too late to start a vertical AI enterprise startup? Before that in the headlines, Elon turns on what he calls the world's biggest AI training supercluster. The AI Daily Brief is a daily podcast and video about the most important news and discussions in AI. To join the conversation, follow the Discord link in our show notes. Hello, friends, quick note before we dive in, you might notice that the audio feels a little off in the headline section. I realized only after recording that I was recording with the main computer microphone rather than the normal good podcast our microphone. We've done our best to try to fix it up, but bear with me for the three or four minutes of that. And then in the main section, you'll hear the good, nice audio once again.

Starting point is 00:00:43 Welcome back to the AI Daily Brief Headlines edition, all the daily AI news you need in around five minutes. While most of the LLM war discussion recently has been around either A, the push to competition on the smaller model front, something that we've talked about recently and even have a story on today, or of course, the question of GPT40 versus Claude 3.2.5.2. Sonnet. Still, there are a handful of lurking competitors who, it feels clear, haven't really fully entered the fray, one of those, of course, being Elon Musk's XAI. Now, Grogh has done some things. We're on version 1.5, version 1 was open sourced, and simply by virtue of the fact that it is an Elon project, meaning that it gets to be integrated into Twitter slash X, as well as access that

Starting point is 00:01:26 data, and in the fact that maybe at some point there'll be some Tesla connection, means that it would be highly inadvisable to write them off. What's more, Elon has been talking for a while about the fact that they were going to invest a huge amount in computing resources, and as of this morning, we got a little bit more information about that. At 5.57 a.m. Eastern Time, Elon tweeted, nice work by XAI team, Nvidia, and supporting companies getting Memphis supercluster training started at 4.20am local time.

Starting point is 00:01:53 With 100,000 liquid-cooled H-100 on a single RDMA fabric, it's the most powerful AI training cluster in the world. This is a significant advantage in training the world's most powerful AI by every metric by December this year. Elon has indicated previously that GROC 3 would be trained on these 100,000 H100 chips. And what's more, there's been a flow of a number of Tesla employees over to XAI, so much that some in the Tesla community have even accused Elon of poaching from himself. This supercluster was a big part of what Elon was fundraising for, that was originally going to be $3 billion raised on a $21 billion valuation, but ballooned at the last minute, to $6 billion on 24. Reports are that between half and two-thirds of that will go to simply buy and

Starting point is 00:02:33 compute. I think for us, one takeaway is that we now have Elon throwing down a clear gauntlet. He wants to have the world's most powerful AI by every metric by December this year. Next up, speaking of NVIDIA, one of the few headwinds facing that company has been U.S. AI chip export restrictions. Throughout the last couple of years, the Biden administration has continuously tightened those rules, forcing companies that sell chips to China to frequently update their offerings to come in under various power thresholds. Still, it seems for now the calculus is that rather than abandoning the Chinese market, it makes more sense to try to customize offerings to be as close to the state of the art as possible within those restrictions, with the latest efforts from Nvidia coming in a chip

Starting point is 00:03:14 they're calling the B20. This is part of the Blackwell chip line that was announced in March, which is slated to go into production later this year. The information writes, it's unclear how NVIDIA designs the B20 to align with U.S. rules while keeping competitive in China, basically referring to the fact that as these restrictions take hold, more and more of the Chinese market is moving to local alternatives where they can. Back to small model competition, Apple has recently shared a new small model that outperforms mistral. Venturebee points out that Apple has released a package of two models in the DCLM or Data Comp for Language Models Project on Hugging Face, one with 7 billion parameters and the other with 1.4 billion.

Starting point is 00:03:52 Enterbeat writes, they both performed pretty well on the benchmarks, especially the bigger one, which has outperformed Mistral 7B and is closing in on other leading open models, including Lama 3 and Gemma. Vaisal Shankar from Apple writes, we have released our DCLM models on Hugging Face. To our knowledge, these are by far the best performing truly open source models, open data, open weight models, open training code. Our 6.9 billion base model is competitive with Mistral Lama 3, Gemma, 2 on most benchmarks, but we also release our entire training set and pre-training recipe for the community to build on top of.

Starting point is 00:04:20 We also have a strong 1.4B version, which significantly outperforms recently released SOTA Small LM models. We additionally release an instruction-tuned variant of these models that exhibit strong performance on IT benchmarks, like Alpacca Bench. As we pointed out last week, the competition for small model terrain is just as vicious and intense in many ways as the competition for AGI in the state of the art. Speaking of Apple and competition, AMD recently has been talking a big game. The Verge writes, AMD says its new laptop chips can beat Apple, but still has to prove it. The Verge writes,

Starting point is 00:04:51 2024 will go down in tech history as the year Microsoft was finally able to make Windows laptops into serious competitors to the MacBook. So far, that's thanks to Qualcomm's new Snapdragon chips, which switched to a homogeneous chip architecture, increased clock speeds, and caught up to Apple's speedy and power-efficient processors. But now, AMD says it has chips

Starting point is 00:05:07 that can take on the MacBook too, and keep the company's processors in the mix. In fact, at a recent event, one of the Verge authors writes, I heard AMD brag about beating the MacBook more than I've ever heard a company directly targeted a competitor before. The general vibe of the piece, however, is prove it.

Starting point is 00:05:22 For now, lots of interesting competition to start the week, but that's going to do it for the AI Daily Brief Headlines edition. Up next to the main episode. Today's episode is brought to you by Super Intelligent, the platform for fun, fast AI learning. Super has a ton of new things going on. We recently announced our partnership with Spotify, through which users of that app can now access Super Intelligent content

Starting point is 00:05:43 directly from their mobile apps. We've also just launched the AI learning feed. In addition to seeing the tutorials that we're dropping, there are polls, news items with related lessons, and a chance for people to show off the projects and use cases that are making AI come alive for them. We've also just kicked off the Super Summer Challenge, where each week we'll share a new challenge that you can use to discover new AI tools and use cases. Go to Bsuper.a.i and use code superfund for 50% off your first two months. That's Bsuper.a.i. Today's episode is brought to you by Venice. The leading AI companies store your

Starting point is 00:06:17 entire conversation history and attach it to your identity forever. Every question you ask, every answer you receive, every image you generate, every thought you share with the machine, it's all being spied on. If you trust all the company's hackers and NSA board members that will ever have access to your AI conversations, then rejoice, for you are well served. For the rest of us, Venice is an alternative. Venice is a powerful AI app for text, image, and cogeneration that respects you as a sovereign individual and believes privacy and free speech are not only human rights, but are necessary for civilizational advancement. private, permissionless, and uncensored. You can try it for free without an account at venice.a.i.

Starting point is 00:06:53 Welcome back to the AI Daily Brief. Over the weekend, I saw an interesting conversation emerge around effectively whether it's too late to start building some sort of vertical AI startup going after a particular industry. I actually think it's a really useful lens by which to understand where we are in the enterprise cycle as relates to AI. And so I want to talk through this a little bit more in depth, bringing into it some of the conversations that we're having over at Super Intelligent. Our main customer base at Superintelligent is individuals and enterprise customers who are thinking about how to use AI in their particular professional verticals. So we have a lot of reps just with this particular conversation. Now, where this started was with this

Starting point is 00:07:36 provocative tweet from at Emily and VC, where she wrote, calling it now, Harvey, the legal AI company, will end up being roadkill on the side of the highway, complete smoke and mirrors company. More precisely, it's a zero. Likes aren't public, but y'all would be floored by the people who are liking this tweet, some of the top founders and investors in the Valley. This is the company she's referring to. Harvey, which frames itself as the trusted legal AI platform. Augment your workflows using domain-specific models trained by and four professional service providers.

Starting point is 00:08:06 In another article by Edward Bustell, some of Harvey's history was given. OpenAI invested in the company the same month that it launched ChatGBT, which, as Biscall says, means that Harvey arguably had a three to six-month head start on virtually all of its competitors in legal technology. The company was literally working with ChatCBT-GPT 3.5 before any other generative AI products in legal. There is no question that Harvey could have come out of the gate with a go-to-market strategy that could have easily captured the imagination of legal early adopters across the spectrum of specializations. Harvey, with its massive first-mover advantage, could have set the standard for legal LLMs while creating the workflows for ethical use and manage the hallucination issue.

Starting point is 00:08:42 Now, the rest of the article is a bit skeptical, but I'm more interested in why, there is skepticism here, then in trying to validate it or not, as I've spent no time with Harvey and have no real sense of how the company is actually doing. Two reference points that I think are interesting. The first is a comparison to Bloomberg GPT. For those of you who don't remember, Bloomberg GPT was a specially trained finance LLM that took advantage of all of Bloomberg's data, and basically Bloomberg had bet that with their specialized data, they'd be able to outperform a generalist model. However, when GPT4 came out, which of course did not have any specialized finance training, it still beat it on basically every finance tasks.

Starting point is 00:09:18 Wrote Professor Ethan Mollick back then, it is part of a pattern. The smartest generalist frontier models beat specialized models and specialized topics. Your special proprietary data may be less useful than you think in the world of LLMs. Again, I don't know if Harvey is experiencing any sort of version of this. They claim that their custom trained models are preferred 97% of the time by clients, But I do think that right now with any verticalized models, this is a really important question to watch. It's one that different people are going to disagree on and will have a huge impact on how vertical industry-oriented LLMs function or not in the future.

Starting point is 00:09:51 The other thing, though, that I want to point out is that Emily followed up her own tweet with a screenshot from a big law partner that uses quote-unquote Harvey from her DMs. That law partner said, Harvey has an insane amount of inactive, quote-unquote, customers that will churn. big law firms that want to tell clients they use AI sign-up and then no one uses it. Now, this is sort of presented in the context of this conversation as a big gotcha, like Harvey is tricking people in some way. And of course, this is just an anonymous source in the DMs. But even if this is accurate, I don't think it necessarily means what it's being presented to mean here, i.e. that Harvey is just a smoke and mirrors company.

Starting point is 00:10:28 There is a huge gap right now. It's the gap that we spend literally all day in every day between the interest and excitement and intention to use AI to get value and the actual capacity to do so. There are a litany of reasons why that gap exists. Yes, one piece of it may be this sort of AI theater that this big law partner is arguing is happening, where big law firms just want to tell clients they use AI, even though they don't. But there are other much less pernicious issues as well. There's a capacity issue in helping people actually learn these tools. And then there's also the difficult, slow process of industries taking the time to figure out what parts of their process these tools

Starting point is 00:11:06 actually helped with. Now, Bruno Cobah took this conversation and brought it up a level from the specifics of Harvey and the legal industry to vertical AI startups in general. Bruno is an MBA candidate at Stanford and writes, I've been investigating vertical AI startups profoundly for the past few weeks. I think we're in a very strange part of the cycle in AI startup funding and development. One, literally every vertical, finance law, health care, et cetera, is now populated with the AI startups building on top of foundation models. If you start today, it's too late to get first mover advantage. Two, however, foundation models are still not quite there yet to solve problems in those verticals in a tangible bulletproof way. Those who build GPT wrappers, even if they

Starting point is 00:11:43 refuse to admit the label, rely on OpenAI's next big model to truly prove sticky long-term retention. And those who decide to train their own specialized models like Harvey are apparently not outperforming smart general models GPT40 Claude 3.5 Sonet. Three, startups are facing a massive dependence on big AI lab shipping the next big model. And once that happens, it might just be that we won't need GPP wrappers at all. ChatGPT will be enough for whatever task we hand them. Four, this is fundamentally so different from the internet and mobile innovation cycles. Back then, incumbents could not touch every single vertical with their product offerings, so startups filled the void and became multi-billion dollar companies, e.g. Apple launched the app store, but wouldn't build apps for food delivery,

Starting point is 00:12:21 dating, ride sharing, et cetera. Now, when GPT5 launches, OpenAI can touch virtually every industry they want. 5. My sentiment is that we're living in a kind of born too late to explore the Earth, born too early to explore space vibe, in this 23-24 cycle in the realm of AI startups. And when the rocket ships to space are finally ready, i.e. truly smart general LLMs, vertical AI startups will capture much less value versus incumbents when compared to previous cycles. I think here it's worth narrowing the parameters of the conversation or at least breaking it apart between, on the one hand, the question of AI startups in general. And on the other, the specifics of verticalized LLMs.

Starting point is 00:12:57 As I mentioned before, there is this big question around whether specially trained or fine-tuned vertical models can ever actually out-compete the generalist state of the art, and Bruno is absolutely right that the answer to that question will have a significant impact on how AI rolls out across industries. Gary Tan, the president and CEO at Y Combinator, however, isn't so sure. He writes, I don't think it's too late to enter almost any software market with an LLM-powered alternative if you want to. We are seeing tiny teams of a few people build valuable software leapfrogging incumbents with even

Starting point is 00:13:28 today's frontier models. The war will be retention. Who can serve your vertical better? I responded to Gary's post, which was, of course, the inspiration for this full episode, and said we talked to hundreds of enterprise customers for these startups every week. There's no universe in which the game is one right now. In fact, if anything, enterprises are getting more sophisticated about wanting a combination of powerful NFAI with real product UI and U.S.

Starting point is 00:13:49 And that's why I wanted to separate the intrinsic question inside Bruno's tweet around the ultimate role for custom-trained verticalized LLM models from the question more broadly of whether the big labs are just going to eat all of the enterprise business. The era that we are in right now is a use case exploration era. It is an experimentation era that is happening both vertically inside the organization, i.e. these big custom models that take advantage of all their data, but it's also happening horizontally, where, for example, individuals across those law firms are experimenting and figuring out which use cases actually save them time and make things easier or better. Part of the opportunity, I believe, is for companies to come in and actually design

Starting point is 00:14:29 product experiences around LLMs that interact positively with those high-value use cases as they get discovered. In other words, I tend to think that for products to be sticky inside these companies, they're going to have to actually be products. They can't just be chat GPT, but for legal. Now, one thing that I will also note, where Bruno writes, this is fundamentally so different from the internet and mobile, back that incumbents could not touch every single vertical with their product offerings. Sort of. But social really challenges that idea. When I moved to San Francisco in 2008, it was specifically to help change.org compete in the social network for social good space. It was a time when four years after Facebook had been launched, pretty much everyone was convinced

Starting point is 00:15:10 that there was going to be a Facebook for everything. There was going to be a Facebook for change. There was going to be a Facebook for, you name it. Pick your industry vertical. There was going to be a Facebook for that. There were so many social networks for social change at that time that there was even an aggregator platform called social actions, which took inputs from all 40 of those platforms and allowed you to do it from a single spot. Now, of course, at the end of the day, there wasn't a place for a social network for social change. The social network for social change, just like the social network for everything else, was just meta. The things that came later, that were also social networks, had fundamentally different type of content experiences at their

Starting point is 00:15:47 core, Instagram with mobile and photos, Snapchat with mobile and disappearing messages, et cetera, et cetera, et cetera. And yet still, the ecosystem around all of those social channels is huge. So I'm not sure. What I do know is that for aspiring entrepreneurs out there in the AI space interested in particular verticals, our read from all our conversations is that there is still a ton of space and that the only thing that is clear at this stage is that enterprises want partners who can actually help them take advantage of AI to drive. real value. If you think you have a good answer for that, I tend to think that there's more of a market than many might think. For now, though, that is going to do it for today's AI Daily Brief.

Starting point is 00:16:26 Until next time, peace.

The AI Daily Brief: Artificial Intelligence News and Analysis - Elon Turns On "Most Powerful AI Training Cluster In the World"

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.