The AI Daily Brief: Artificial Intelligence News and Analysis - Elon Turns On "Most Powerful AI Training Cluster In the World"
Episode Date: July 22, 2024At 4:20am, Elon Musk turned on the Memphis Supercluster to begin training what he claims will be the world's most powerful AI by December. Also NLW explores a question: is it too late to start AI ...startups (or at least vertical LLMs)? Concerned about being spied on? Tired of censored responses? AI Daily Brief listeners receive a 20% discount on Venice Pro. Visit https://venice.ai/nlw and enter the discount code NLWDAILYBRIEF. Learn how to use AI with the world's biggest library of fun and useful tutorials: https://besuper.ai/ Use code 'podcast' for 50% off your first month. The AI Daily Brief helps you understand the most important news and discussions in AI. Subscribe to the podcast version of The AI Daily Brief wherever you listen: https://pod.link/1680633614 Subscribe to the newsletter: https://aidailybrief.beehiiv.com/ Join our Discord: https://bit.ly/aibreakdown
Transcript
Discussion (0)
Today on the AI Daily Brief, is it too late to start a vertical AI enterprise startup?
Before that in the headlines, Elon turns on what he calls the world's biggest AI training supercluster.
The AI Daily Brief is a daily podcast and video about the most important news and discussions in AI.
To join the conversation, follow the Discord link in our show notes.
Hello, friends, quick note before we dive in, you might notice that the audio feels a little off in the headline section.
I realized only after recording that I was recording with the main computer microphone rather than the normal good podcast
our microphone. We've done our best to try to fix it up, but bear with me for the three or four
minutes of that. And then in the main section, you'll hear the good, nice audio once again.
Welcome back to the AI Daily Brief Headlines edition, all the daily AI news you need in around
five minutes. While most of the LLM war discussion recently has been around either A, the push
to competition on the smaller model front, something that we've talked about recently and even
have a story on today, or of course, the question of GPT40 versus Claude 3.2.5.2.
Sonnet. Still, there are a handful of lurking competitors who, it feels clear, haven't really
fully entered the fray, one of those, of course, being Elon Musk's XAI. Now, Grogh has done some things.
We're on version 1.5, version 1 was open sourced, and simply by virtue of the fact that it is an
Elon project, meaning that it gets to be integrated into Twitter slash X, as well as access that
data, and in the fact that maybe at some point there'll be some Tesla connection,
means that it would be highly inadvisable to write them off.
What's more, Elon has been talking for a while about the fact that they were going to invest
a huge amount in computing resources, and as of this morning, we got a little bit more information
about that.
At 5.57 a.m. Eastern Time, Elon tweeted,
nice work by XAI team, Nvidia, and supporting companies getting Memphis supercluster training
started at 4.20am local time.
With 100,000 liquid-cooled H-100 on a single RDMA fabric, it's the most powerful AI
training cluster in the world. This is a significant advantage in training the world's most powerful
AI by every metric by December this year. Elon has indicated previously that GROC 3 would be trained
on these 100,000 H100 chips. And what's more, there's been a flow of a number of Tesla employees over to
XAI, so much that some in the Tesla community have even accused Elon of poaching from himself.
This supercluster was a big part of what Elon was fundraising for, that was originally going to be
$3 billion raised on a $21 billion valuation, but ballooned at the last minute,
to $6 billion on 24. Reports are that between half and two-thirds of that will go to simply buy and
compute. I think for us, one takeaway is that we now have Elon throwing down a clear gauntlet. He wants
to have the world's most powerful AI by every metric by December this year. Next up, speaking of
NVIDIA, one of the few headwinds facing that company has been U.S. AI chip export restrictions.
Throughout the last couple of years, the Biden administration has continuously tightened those rules,
forcing companies that sell chips to China to frequently update their offerings to come in under various
power thresholds. Still, it seems for now the calculus is that rather than abandoning the Chinese
market, it makes more sense to try to customize offerings to be as close to the state of the
art as possible within those restrictions, with the latest efforts from Nvidia coming in a chip
they're calling the B20. This is part of the Blackwell chip line that was announced in March,
which is slated to go into production later this year. The information writes,
it's unclear how NVIDIA designs the B20 to align with U.S. rules while keeping competitive in China,
basically referring to the fact that as these restrictions take hold, more and more of the Chinese
market is moving to local alternatives where they can.
Back to small model competition, Apple has recently shared a new small model that outperforms
mistral. Venturebee points out that Apple has released a package of two models in the DCLM or
Data Comp for Language Models Project on Hugging Face, one with 7 billion parameters and the other with 1.4 billion.
Enterbeat writes, they both performed pretty well on the benchmarks, especially the bigger one,
which has outperformed Mistral 7B and is closing in on other leading open models, including
Lama 3 and Gemma.
Vaisal Shankar from Apple writes, we have released our DCLM models on Hugging Face.
To our knowledge, these are by far the best performing truly open source models, open data,
open weight models, open training code.
Our 6.9 billion base model is competitive with Mistral Lama 3, Gemma, 2 on most benchmarks,
but we also release our entire training set and pre-training recipe for the community to build on top of.
We also have a strong 1.4B version, which significantly outperforms recently released
SOTA Small LM models. We additionally release an instruction-tuned variant of these models that
exhibit strong performance on IT benchmarks, like Alpacca Bench.
As we pointed out last week, the competition for small model terrain is just as vicious
and intense in many ways as the competition for AGI in the state of the art.
Speaking of Apple and competition, AMD recently has been talking a big game.
The Verge writes, AMD says its new laptop chips can beat Apple, but still has to prove it.
The Verge writes,
2024 will go down in tech history
as the year Microsoft was finally able to make Windows laptops
into serious competitors to the MacBook.
So far, that's thanks to Qualcomm's new Snapdragon chips,
which switched to a homogeneous chip architecture,
increased clock speeds,
and caught up to Apple's speedy and power-efficient processors.
But now, AMD says it has chips
that can take on the MacBook too,
and keep the company's processors in the mix.
In fact, at a recent event,
one of the Verge authors writes,
I heard AMD brag about beating the MacBook
more than I've ever heard a company
directly targeted a competitor before.
The general vibe of the piece, however, is prove it.
For now, lots of interesting competition to start the week,
but that's going to do it for the AI Daily Brief Headlines edition.
Up next to the main episode.
Today's episode is brought to you by Super Intelligent,
the platform for fun, fast AI learning.
Super has a ton of new things going on.
We recently announced our partnership with Spotify,
through which users of that app can now access Super Intelligent content
directly from their mobile apps.
We've also just launched the AI learning feed.
In addition to seeing the tutorials that we're dropping, there are polls, news items with related
lessons, and a chance for people to show off the projects and use cases that are making AI come
alive for them. We've also just kicked off the Super Summer Challenge, where each week we'll share
a new challenge that you can use to discover new AI tools and use cases. Go to Bsuper.a.i and use
code superfund for 50% off your first two months. That's Bsuper.a.i.
Today's episode is brought to you by Venice. The leading AI companies store your
entire conversation history and attach it to your identity forever. Every question you ask,
every answer you receive, every image you generate, every thought you share with the machine,
it's all being spied on. If you trust all the company's hackers and NSA board members that
will ever have access to your AI conversations, then rejoice, for you are well served. For the rest
of us, Venice is an alternative. Venice is a powerful AI app for text, image, and cogeneration
that respects you as a sovereign individual and believes privacy and free speech are not only
human rights, but are necessary for civilizational advancement.
private, permissionless, and uncensored. You can try it for free without an account at venice.a.i.
Welcome back to the AI Daily Brief. Over the weekend, I saw an interesting conversation emerge
around effectively whether it's too late to start building some sort of vertical AI startup
going after a particular industry. I actually think it's a really useful lens by which to
understand where we are in the enterprise cycle as relates to AI. And so I want to talk through
this a little bit more in depth, bringing into it some of the conversations that we're having
over at Super Intelligent. Our main customer base at Superintelligent is individuals and enterprise
customers who are thinking about how to use AI in their particular professional verticals.
So we have a lot of reps just with this particular conversation. Now, where this started was with this
provocative tweet from at Emily and VC, where she wrote, calling it now, Harvey, the legal AI company,
will end up being roadkill on the side of the highway, complete smoke and mirrors company.
More precisely, it's a zero.
Likes aren't public, but y'all would be floored by the people who are liking this tweet,
some of the top founders and investors in the Valley.
This is the company she's referring to.
Harvey, which frames itself as the trusted legal AI platform.
Augment your workflows using domain-specific models trained by and four professional service providers.
In another article by Edward Bustell, some of Harvey's history was given.
OpenAI invested in the company the same month that it launched ChatGBT, which, as Biscall says,
means that Harvey arguably had a three to six-month head start on virtually all of its competitors
in legal technology. The company was literally working with ChatCBT-GPT 3.5 before any other generative
AI products in legal. There is no question that Harvey could have come out of the gate with a
go-to-market strategy that could have easily captured the imagination of legal early adopters
across the spectrum of specializations. Harvey, with its massive first-mover advantage, could have set
the standard for legal LLMs while creating the workflows for ethical use and manage the hallucination issue.
Now, the rest of the article is a bit skeptical, but I'm more interested in why,
there is skepticism here, then in trying to validate it or not, as I've spent no time with Harvey
and have no real sense of how the company is actually doing. Two reference points that I think are
interesting. The first is a comparison to Bloomberg GPT. For those of you who don't remember, Bloomberg
GPT was a specially trained finance LLM that took advantage of all of Bloomberg's data, and basically
Bloomberg had bet that with their specialized data, they'd be able to outperform a generalist model.
However, when GPT4 came out, which of course did not have any specialized finance training,
it still beat it on basically every finance tasks.
Wrote Professor Ethan Mollick back then,
it is part of a pattern.
The smartest generalist frontier models beat specialized models and specialized topics.
Your special proprietary data may be less useful than you think in the world of LLMs.
Again, I don't know if Harvey is experiencing any sort of version of this.
They claim that their custom trained models are preferred 97% of the time by clients,
But I do think that right now with any verticalized models, this is a really important question to watch.
It's one that different people are going to disagree on and will have a huge impact on how vertical industry-oriented LLMs function or not in the future.
The other thing, though, that I want to point out is that Emily followed up her own tweet with a screenshot from a big law partner that uses quote-unquote Harvey from her DMs.
That law partner said, Harvey has an insane amount of inactive, quote-unquote, customers that will churn.
big law firms that want to tell clients they use AI sign-up and then no one uses it.
Now, this is sort of presented in the context of this conversation as a big gotcha,
like Harvey is tricking people in some way.
And of course, this is just an anonymous source in the DMs.
But even if this is accurate, I don't think it necessarily means what it's being presented
to mean here, i.e. that Harvey is just a smoke and mirrors company.
There is a huge gap right now.
It's the gap that we spend literally all day in every day between the interest and
excitement and intention to use AI to get value and the actual capacity to do so. There are a litany
of reasons why that gap exists. Yes, one piece of it may be this sort of AI theater that this big
law partner is arguing is happening, where big law firms just want to tell clients they use AI,
even though they don't. But there are other much less pernicious issues as well. There's a capacity
issue in helping people actually learn these tools. And then there's also the difficult,
slow process of industries taking the time to figure out what parts of their process these tools
actually helped with. Now, Bruno Cobah took this conversation and brought it up a level from
the specifics of Harvey and the legal industry to vertical AI startups in general. Bruno is an MBA
candidate at Stanford and writes, I've been investigating vertical AI startups profoundly for the
past few weeks. I think we're in a very strange part of the cycle in AI startup funding and
development. One, literally every vertical, finance law, health care, et cetera, is now populated with
the AI startups building on top of foundation models. If you start today, it's too late to get
first mover advantage. Two, however, foundation models are still not quite there yet to solve
problems in those verticals in a tangible bulletproof way. Those who build GPT wrappers, even if they
refuse to admit the label, rely on OpenAI's next big model to truly prove sticky long-term retention.
And those who decide to train their own specialized models like Harvey are apparently not outperforming
smart general models GPT40 Claude 3.5 Sonet. Three, startups are facing a massive dependence on
big AI lab shipping the next big model. And once that happens, it might just be that we won't need
GPP wrappers at all. ChatGPT will be enough for whatever task we hand them. Four, this is fundamentally
so different from the internet and mobile innovation cycles. Back then, incumbents could not touch
every single vertical with their product offerings, so startups filled the void and became multi-billion
dollar companies, e.g. Apple launched the app store, but wouldn't build apps for food delivery,
dating, ride sharing, et cetera. Now, when GPT5 launches, OpenAI can touch virtually every industry they want.
5. My sentiment is that we're living in a kind of born too late to explore the Earth,
born too early to explore space vibe, in this 23-24 cycle in the realm of AI startups.
And when the rocket ships to space are finally ready, i.e. truly smart general LLMs,
vertical AI startups will capture much less value versus incumbents when compared to previous cycles.
I think here it's worth narrowing the parameters of the conversation or at least breaking it
apart between, on the one hand, the question of AI startups in general. And on the other,
the specifics of verticalized LLMs.
As I mentioned before, there is this big question around whether
specially trained or fine-tuned vertical models can ever actually out-compete the
generalist state of the art, and Bruno is absolutely right that the answer to that question
will have a significant impact on how AI rolls out across industries.
Gary Tan, the president and CEO at Y Combinator, however, isn't so sure.
He writes, I don't think it's too late to enter almost any software market with an LLM-powered
alternative if you want to.
We are seeing tiny teams of a few people build valuable software leapfrogging incumbents with even
today's frontier models.
The war will be retention.
Who can serve your vertical better?
I responded to Gary's post, which was, of course, the inspiration for this full episode,
and said we talked to hundreds of enterprise customers for these startups every week.
There's no universe in which the game is one right now.
In fact, if anything, enterprises are getting more sophisticated about wanting a combination
of powerful NFAI with real product UI and U.S.
And that's why I wanted to separate the intrinsic question inside Bruno's tweet around
the ultimate role for custom-trained verticalized LLM models from the question more broadly of
whether the big labs are just going to eat all of the enterprise business. The era that we are in right
now is a use case exploration era. It is an experimentation era that is happening both vertically
inside the organization, i.e. these big custom models that take advantage of all their data,
but it's also happening horizontally, where, for example, individuals across those law firms
are experimenting and figuring out which use cases actually save them time and make things easier or
better. Part of the opportunity, I believe, is for companies to come in and actually design
product experiences around LLMs that interact positively with those high-value use cases as they get
discovered. In other words, I tend to think that for products to be sticky inside these companies,
they're going to have to actually be products. They can't just be chat GPT, but for legal.
Now, one thing that I will also note, where Bruno writes, this is fundamentally so different from
the internet and mobile, back that incumbents could not touch every single vertical with their product
offerings. Sort of. But social really challenges that idea. When I moved to San Francisco in 2008,
it was specifically to help change.org compete in the social network for social good space.
It was a time when four years after Facebook had been launched, pretty much everyone was convinced
that there was going to be a Facebook for everything. There was going to be a Facebook for
change. There was going to be a Facebook for, you name it. Pick your industry vertical. There was
going to be a Facebook for that. There were so many social networks for social change at that time
that there was even an aggregator platform called social actions, which took inputs from all 40
of those platforms and allowed you to do it from a single spot. Now, of course, at the end of the day,
there wasn't a place for a social network for social change. The social network for social change,
just like the social network for everything else, was just meta. The things that came later,
that were also social networks, had fundamentally different type of content experiences at their
core, Instagram with mobile and photos, Snapchat with mobile and disappearing messages,
et cetera, et cetera, et cetera. And yet still, the ecosystem around all of those social channels
is huge. So I'm not sure. What I do know is that for aspiring entrepreneurs out there in
the AI space interested in particular verticals, our read from all our conversations is that
there is still a ton of space and that the only thing that is clear at this stage is that
enterprises want partners who can actually help them take advantage of AI to drive.
real value. If you think you have a good answer for that, I tend to think that there's more of a
market than many might think. For now, though, that is going to do it for today's AI Daily Brief.
Until next time, peace.
