This Week in Startups - Bill Gurley and Sunny Madra talk open-source vs. proprietary AI | E1825
Episode Date: October 10, 2023This Week in Startups is brought to you by… LinkedIn Marketing. To redeem a $100 LinkedIn ad credit and launch your first campaign, go to linkedin.com/thisweekinstartups Vanta. Compliance and securi...ty shouldn't be a deal-breaker for startups to win new business. Vanta makes it easy for companies to get a SOC 2 report fast. TWiST listeners can get $1,000 off for a limited time at vanta.com/twist CLA. Innovation takes balance. CLA's CPAs, consultants, and wealth advisors can help you get from startup to where you want to end up. Get started now at CLAconnect.com/tech * Today’s show: Bill Gurley and Sunny Madra join Jason to discuss open-source AI vs. proprietary AI (1:12), friends and foes of open-source AI (5:42), and strategic choices companies make between open and closed approaches (9:48). Then, Sunny demos more AI tools (22:45), and much more! * Time stamps: (0:00) Sunny Madra joins Jason (1:12) Bill Gurley joins to break down open-source vs. proprietary AI (5:42) Friends and foes of open-source AI (8:17) LinkedIn Marketing - Get a $100 LinkedIn ad credit at https://linkedin.com/thisweekinstartups (9:48) Strategic choices companies make between open and closed approaches (21:37) Vanta - Get $1000 off your SOC 2 at https://vanta.com/twist (22:45) Sunny demos LLaVA: Large Language and Vision Assistant (29:56) CLA - Get started with CLA's CPAs, consultants, and wealth advisors now at https://claconnect.com/tech (31:00) Sunny breaks down parameter models (38:56) Companies manufacturing their own chips (42:26) Sunny demos Mistral AI (46:56) Jason and Sunny demo Meta AI on WhatsApp (50:44) FTX and the SBF trial * Follow Bill: https://twitter.com/bgurley Follow Sunny: https://twitter.com/sundeep * Check out LLaVA: https://llava-vl.github.io/ Check out Mistral: https://huggingface.co/spaces/Open-Orca/Mistral-7B-OpenOrca?utm_source=substack&utm_medium=email * Read LAUNCH Fund 4 Deal Memo: https://www.launch.co/four Apply for Funding: https://www.launch.co/apply Buy ANGEL: https://www.angelthebook.com Great recent interviews: Steve Huffman, Brian Chesky, Aaron Levie, Sophia Amoruso, Reid Hoffman, Frank Slootman, Billy McFarland, PrayingForExits, Jenny Lefcourt Check out Jason’s suite of newsletters: https://substack.com/@calacanis * Follow Jason: Twitter: https://twitter.com/jason Instagram: https://www.instagram.com/jason LinkedIn: https://www.linkedin.com/in/jasoncalacanis * Follow TWiST: Substack: https://twistartups.substack.com Twitter: https://twitter.com/TWiStartups YouTube: https://www.youtube.com/thisweekin * Subscribe to the Founder University Podcast: https://www.founder.university/podcast
Transcript
Discussion (0)
You don't want any one party controlling a platform technology.
You want it to be open source.
You want closed source solutions, private company solutions, open source solutions.
You want a range of opportunities.
And remember, the last time we had some founders say,
trust me, I'll get us some regulation.
That was SBF.
And he was going to be the one who got us federal regulation for crypto.
And he's on trial.
That's a very moment for a bunch of Mushuganah.
This Week in Startups is brought to you by LinkedIn Marketing.
To redeem a $100 LinkedIn ad credit and launch your first campaign,
go to LinkedIn.com slash this week in startups.
Vanta.
Compliance and security shouldn't be a deal breaker for startups to win new business.
Vanta makes it easy for companies to get a SOC2 report fast.
Twist listeners can get $1,000 off for a limited time at Vanta.com slash
twist and
CLA.
Innovation takes balance.
CLA's CPAs, consultants,
and wealth advisors can help
you get from startup to where you want
to end up. All right, everybody,
welcome back to this weekend startups. It is
Monday. It is Madra
Monday. Yes, that's right. Every Monday,
Money Madra joins us,
Sunny Madra of
definitive AI and we do our
AI demos. And
this week,
Sonny, you told me you were having some deep weekend discussions.
We know Silicon Valley and the tech industry on the weekends.
That's when everybody goes deep.
They don't have meetings, but the back channels start to light up.
And the back channel was lighting up this weekend, talking about what topic?
Well, open source, right?
There was a couple of big threads that kicked up this weekend, and, you know, the tech
start flying, and then I think, you know, we have a special opportunity to have a guest
with us today.
Oh, okay.
Who do we ask?
Yeah.
We've got the legendary, the goat, Bill Gurley.
All right.
Bill Gurley, I think maybe third time.
No, only second time on this weekend startups.
You were last on 2017, episode 722 for folks.
Yeah.
So welcome back.
Bill Gurley, of course, from Benchmark and many great companies.
You're particularly passionate about open source.
why in relation to AI, Bill?
Well, I mean, obviously, there's a lot of answers to that question, but most recently,
when I did the regulatory capture speech at your conference, I was mentioning at the very
end that oddly, some of the, what you might call the early incumbents in AI software,
we're running out and promoting the idea that open source should somehow be curtailed or
knee-capped. In particular, it's quite notable that all of the loud voices are either
executives at these companies and or large investors at these companies. And because of the way
these companies have raised money, the largeness of those investments are in the tens and
hundreds of millions of dollars, so quite a bit at stake. And there now seems to be a
growing populace, and this is what led to the conversation. So we're having.
that are worried that these companies are basically, you know, at the very start of this AI movement,
trying to cut off their biggest competition whatsoever.
And the thing that would probably unleash the most innovation and the most prosperity,
which would be if open source models were prevalent.
And this is more Sunny's world than me, so I'll let him comment.
But I do believe that the stats about the performance,
of these open source models is actually quite compelling.
And what I'm hearing internally from our portfolio is that specifically the Lama too,
but Sonny sent me another one today, are really starting to gain market share amongst
the startups that are using these tools.
So it would be really unfortunate if they found.
And when I spoke at your conference, I mentioned, you know, this guy, George Stigler,
who had won a Nobel Prize in Academic Mission, who had talked about how companies use regular
regulatory capture. And he said the two things they'd try and do is stork competition and protect pricing.
Clearly, if they were successful in getting governments to block open source AI, they would achieve
both of those goals. Yeah. And so that great talk, by the way, if you haven't seen it,
just do a Google search for All in Summit, Bill Gurley, and the name of the talk, 2,851 miles about
regulatory capture. So suddenly, when we look at the space, the open source space, let's get to, you know,
maybe the tail of the tape, some metrics here on how these open source projects are faring against
the large language models that are available by API call, right? And I guess,
paradoxically, open AI, which is now the most closed of all the AI companies, but has open in the name
and started under the mandate that this technology was too powerful for it to be closed. It needed to be
open to everybody. Has gone exactly closed and has gone exactly regulatory capture.
with Sam Altman
begging, literally,
begging for the government
to get involved and to regulate.
So maybe we could talk a little bit about,
you know, the, what are the leading
open source AI projects?
So let's just kind of level set, like, you know,
who the players are because, you know, to Bill's point,
there's many different people that are showing up.
And also this is kind of a good lay of the land
because you see here some startups,
some real incumbents and, you know,
some new folks had really kind of risen to the top.
And so just level setting, both Anthropic and Open AI, you know, I'd say kind of leading
models in the space, definitely closed.
Microsoft is kind of in this unique straddle.
They're open sourcing lots of unique pieces of content, additional frameworks that
you need around, like LLMs, but they're gone all in with Open AI.
Google started this whole thing.
Obviously, the reason others can get there is the papers or OpenSor.
and people have written about them,
but we haven't seen an open model from Google in a while,
although they do support open models in their vertex platform.
And then meta and then Databricks,
and there's another one we'll bring up Mistral as well today.
These folks are fully open.
Commercial use case is different.
Commercial use differs based on their licenses,
but in general they're sharing everything,
and we're seeing really fast innovation.
So that's the level set.
And by the way, just two quick things to add to what Sunny just said.
It's remarkably ironic that OpenAI, when Elon backed it, was open source and moved away from it.
And then second, as Sunny mentioned, I've been reading a ton this weekend to try and get a lot deeper in this world.
And everyone points to this Google paper attention is all you need as the thing that allowed these LLMs to become successful and to really progress.
So you do have an academic paper with all Google people on it that actually led to the technology that's being used by anthropic and open AI.
So I think that's ironic also.
Anyway, sorry to interrupt.
No, no, no.
And going deeper into the playing field, what's your analysis of what's happening here?
What I hear often people say is when you're in the lead, you're closed, and when you're behind, you go open.
So iOS was in the lead and they kept it very closed.
Google search, their algorithm, they're in the lead.
They have a dominant position.
They'll never open up the search algorithm.
But then you have somebody like maybe meta, they feel like they're pretty far behind on language models.
So they open source it.
And the report was they internally leaked it accidentally on purpose.
All right.
Listen, when you're selling to business to business buyers, you really want to get your pitch in front of decision makers.
Why? Because upper level execs are usually the ones making purchasing decisions. Duh. The problem is,
high level folks can be really hard to find and target on most social media platforms. But on LinkedIn,
oh my gun, they know all of the CTOs, all of the CFOs, all of the VPs of finance, engineering, HR, recruiting.
All those titles are sitting there waiting for you. And now let's just talk about the funnel.
LinkedIn's about to hit a billion members. Did you know that? 950 million members.
at this point in time. There are 180 million of those 950 who are senior level execs.
There are 10 million C-level executives in that 180 million senior level execs, which are
part of the 950 million members. I am a C-level executive. I am on LinkedIn all day long because
LinkedIn equals business, business equals LinkedIn and LinkedIn ads are built specifically for
B-to-B marketers. LinkedIn generates two to five times higher return on ad spend than other social
media platforms. LinkedIn equals business, business equals LinkedIn. When people are on LinkedIn,
they're ready to do business. It's that simple. So make business to business marketing, everything
it can be. And get a $100 credit on your next campaign from me, your boy, J-Cal. I'm sending you
the hundy, LinkedIn.com slash this week and startups to claim your credit. That's LinkedIn.com
slash this weekend startups. Terms and conditions apply because they're giving you the hundy.
So maybe some historical information here on when do companies,
choose open versus closed. And to your point, Google was behind in cloud services and led a movement
to open source Kubernetes. So there's a company that has played both sides of the aisle,
depending on where they are. When you look at this field, Open AI and Anthropic, these are
startups. Have you ever seen startups be asked for regulation this early and this often and be so
opposed to open source. Is this a new trend? And what do you attribute it to? I mean, Bill, like,
there's more your area. You answer that one and then I can chime. Yeah. Well, it's funny because
I have certainly never seen it, Jason. I've never seen this early. And maybe it just speaks to the
wild success of open source. I mean, the number of venture backed open source companies today
versus 30 years ago is just amazing. It is a very disruptive way.
to get your technology out there quick and fast.
And it's, you know, one of the other things that Stigler talked about, a phrase I mentioned
several times, he said, when you have this type of blocking regulation, you end up with
a net loss to society.
And I'm one that firmly believes that open source is amazing for society because when you have
technology locked up with patents, it's harder for ideas to spread.
it's harder for ideas to spread across borders and to other countries among all these different
smart people that can get out and innovate.
And so I have never seen something this early.
Obviously, there's a ton at stake.
There's a ton, like I mentioned, these companies have raised money at an unprecedented level.
So, yeah, you could call them early state startups, but you could also, I mean, they've raised
billions of dollars each, you know, and only maybe in the ride-sharing market did you have that
happened so quickly.
And so there's a lot at stake.
There's clearly a lot of stake, but I've never seen startups proactively pitch governments
and not just ours, but several governments around the world.
I've never seen that.
I also find it really suspect that there aren't like technologically, like academicians out there
leading this charge, the people that are leading the charge calling for the regulation
and calling, and some of them raising this question of whether open source should be allowed,
are the incumbents.
They are the one, either the incumbents are their backers.
And some of them, you know, Mustafa from Infliction on a podcast said, I know it looks odd,
me being the one that's asking.
And yeah, it does.
It is odd.
It is odd.
And it would, it reeks of protectionism pulling up the ladder behind you.
And with my.
Microsoft and Google, they seem to be maybe trying to dance along the line here, Bill.
They want to have cloud computing.
They both have major cloud computing services, Azure, Google Cloud.
They want to offer these things, but they also have, you know, a proprietary use for this.
Obviously, Google Search and Bard and the chat interface are going to overlap.
Microsoft trying to get Bing to break out using AI in the office suite.
So they see that as a competitive energy.
What's your, how would you handicap Google and Microsoft's behavior?
Yeah.
Well, I mean, I think to a certain extent, it was interesting on the chart that, I don't know who produced the chart that Sonny put up, but it had Microsoft neutral.
I could find myself believing that.
I mean, they had to do a rather convoluted deal to get access to the technology that they're, you know, using with Open AI.
and I wouldn't be shocked if they're comfortable with a hedge on that,
primarily because they already control these creative products
that they now believe will be enhanced with AI.
And I don't know that whether it's open source AI
or someone else's AI, that it really impacts either of them
because it's the lock-in they have on the products.
So I wouldn't be surprised by that.
And like I mentioned, Google's played this both ways.
So, you know, they pseudo-opensourced Android because within open-source, there's different dimensions on how open it is and whether you've really committed to a third party like the Linux Foundation that runs the regulatory aspect of it.
Or not regulatory, because I don't want to confuse it with the government.
That runs, you know, how it's, how many, the most open projects have a third party that keeps it independent.
Open Source Foundation.
Yeah.
and afford.
There's other than the Open Source Foundation,
but that's the largest one that manage the process.
And Android's not like that.
So anyway,
I had to do that quick aside,
but Google's done.
Whereas Kubernetes,
wide open Lennox Foundation manages it.
And so they've been all over the map.
And as you mentioned,
they once posed a paper.
We love open source,
but it's not right for search.
You know?
Yeah. Well, once you get that lock in,
that's when you don't want to open source it,
because people can then build competitive products.
And as we've seen in search, that space has not seen any changes in 20 years.
Like that has been a locked, you know, box where nobody's innovated for 20 years ago.
A number of people have tried.
I tried myself.
It's very hard to get any kind of a foothold in search.
But I don't think, you know, those two aren't at the forefront of this.
As you mentioned, it seems to be a battle between these extremely well-funded startups and really more of a community.
I'd say the people that are on the other side of this, based on what I saw going around this weekend, it's more of a community.
I mean, it involves, you know, like Jim Zimlin, who runs the Linux Foundation, who's been out talking.
At your conference, I spoke to Stephen Wolfram, who told me he thought it was ridiculous that someone would try and ban open source here on a safety reason.
And so that's what I say when, if I might be more open to listening if I thought it were, you know, some broad group.
of technologists and big thinkers that were making this argument. But all of the arguments
are coming from the people with the most to lose. Yeah. And that seems crazy. Senni, you want to
just give us an idea of how... Well, yeah, I mean, you know, we're going to lose Bill in a
couple of minutes here, but like one thing I'd love to get his thoughts on because I think it really
reinforces the point here is that, so, you know, last week we saw big funding announcer on a mistrol.
It's like a European-based group that, you know, raised $100 plus million to Bill's point.
money. And, you know, they release their model, open source, open for commercial use as well.
And you can see a couple of key points here. One, you can see that there's seven billion
parameter model uses, again, half the amount of memory. And down here, even just against other
open source models, right, there are seven billion parameter model is outperforming, you know,
Lama 213 billion. Again, this is just against open source, but like the rate of innovation is moving
so quickly here that if some of that regulation were to come into play and this,
these folks couldn't put this out there and they had to do it, you know, through some kind
of regulator, which has been, you know, like the FDA or some of that. Those are the ideas
that we've heard put out there. I think we just wouldn't see that. And I'll just add one more
thing. Like last week, JCal, you know, we even demo GPTV. And, you know, a week later, we get
lava and lava as an open source implementation of like a vision model and the same example that we did
there, you know, we have it. And I think, you know, to Bill's point, I can't get my head around,
you know, what it is around open source that's bothering people. We've seen open source in
operating systems. We've seen it in databases, right? We've seen it in mobile phone operating
systems. You know, you get things get patched quicker. When the code is open, you can find
vulnerabilities. And so the arguments that around safety, no one is providing the sort of the
background as to, you know, what is it that is not safe here. It's fairly obvious what's
happening here, Bill. The people who have the lead are using job destruction and the fear of
AI from science fiction and this could get out of control. The demon could be unleashed. They're using
that in order to maintain their lead because they know full well that large numbers of startups or
mid-sized companies embracing an open source project would lead to the demise and would absolutely
evaporate the lead of Open AI.
And this really is about Open AI and Sam Waltman.
Let's call it what it is. Sam is the one who's leading the charge.
Although Mustafa and Reid Hoffman have been perhaps even more vocal, or at least openly vocal
on podcast and whatnot.
So it's not just, but once again, hundreds of millions, billions of state.
Yeah, I mean, I agree with Sonny.
I mean, it's ironic, but Linux is the most stable, most secure operating system that's ever
existed. And I think, you know, I go back to some of the original thesis of why open source
would work and more eyes the better. The more transparency, the better. And the notion that the
people, that it's just so ridiculous for someone to say, this stuff's super scary. Like,
you should be really afraid, but let me do it. You know, I'm, I'm, I'm, you should trust me. But it.
But it's super scary.
But it can do really good things, but it's scary.
Let me take care of it.
Help me be the only one that gets to take care of it.
And yeah, that's sad.
I hope there's quite a few people stepping up.
I hope this doesn't happen.
One last thing I would mention before I have to go.
I think the cat's out of the bag.
So you're not going to stop there from being open source in parts of the globe.
So if you shut it down in a particular region,
that region is going to fail to innovate relative to the other regions that are out there.
There's a really cool piece of open source technology called Risk 5, which you guys have talked about a couple of times in the semiconductor space.
And I believe they intentionally moved the governing body outside the U.S.
because they were afraid of the restrictions the U.S. was putting on semiconductor technology.
And I think it was smart that they did that.
and risk five is going to be wildly successful, you know, regardless of what happens to U.S. regulation.
And so I think governments need to be particularly careful that if they take a first step move here, they're going to put their own, you know, society in a worse place and their own entrepreneurism in a worse place than others around the globe.
I mean, and we've seen this play itself out with social networks and the impact they have on, you know, society.
writ large, the fact that so much power was consolidated in meta and, oh, trust us, we'll be the,
we'll protect you.
It doesn't work.
The person who's making a profit and has a profit mode of the incentive is too great to act
in the public's interest, whereas a group of people working on the project together,
they keep each other in check.
Yeah, that's like, it's a governance thing, right, Bill?
Undoubtedly, and this is, this is an important message to spread immediately.
Yeah.
So I appreciate, I appreciate you guys having me on.
I appreciate you checking in here and a great continuation of your talk from All-In.
Everybody, that's Bill Gurley.
Follow him on Twitter where he's super active.
If you're a SaaS or services company that stores customer data in the cloud, then you need to be, uh,
SOC2 compliant.
You knew that from a third party.
And you need that third party to close big deals.
And if you want to get compliant easier and faster, you need to use V-A-N-T-A.
V-A-V-A-V-A-V-A-V-A-V-T-A-A-M-T-A makes it so easy for you to
get and renew your SOC2. On average Vantan customers are SOC2 compliant in just two to four weeks.
Compare that to three to five months without Vanta. And Vanta can save you hundreds of hours of
manual work and up to 85% of compliance costs. This is a total no-brainer. And Vanta does more
than just SOC2 compliance. They also automate up to 90% compliance for GDPR, HIPAA, and more.
You can't afford to lose out our major customers. We all know that. Listen, it's a hard year.
Last year was hard. You can't lose those major customers because you,
don't have your compliance dialed in. Just work with Vanta. Get your compliance automated and tight
and tight is right. Lock down those big deals. Here's the best part. Vanta's going to give you
$1,000 off.000 off at vanta.com slash twist. That's vanta.com slash twist for a thousand dollars off
your sock, too. Okay, so let's go deeper into the actual models here. Yep. Because the demos are great.
You glossed right over the demo of what you called lava, which I think. I'm going to pull that back up.
I think we should stop.
We should start there because this is, I think, where the rubber meets the road.
If you're wondering why a friend of the pod, Sam Altman, might not want competition,
well, if you look at open source, last week we had the multi-modal chat GPT4.
We were playing with it.
I have it on my phone now.
It's extraordinary.
It's amazing.
You take a picture, you upload a picture, and you ask it to do things with the picture.
One of the things we asked it to do was to make a hamburger recipe based on a hamburger.
that Sunny was interested in.
But there is an open source project called Lava.
Yeah, large language and vision assistant.
Got it.
So this is built on top of Lama, I take it,
or it's its own project?
It's its own project.
Got it.
And it's built to mimic the spirits of multimodal GPT4.
And, you know, just kind of touching a little bit on,
you know, the microchreferral.
being yellow in that chart that we talked about earlier, you can see here it's a put together by researchers at University of Wisconsin and Madison, Microsoft, and Columbia. And so the kind of really interesting group, and I think it does justify that Microsoft is still supporting open source, which is important here. And, you know, it's a great paper. We won't spend too much time looking at it, but I suggest people to,
go look at the GitHub URL, which is here.
But, you know, this doesn't have the multi...
Just type in L-L-A-V-A,
large language and visual and vision assistant.
And so this interface looks very similar to the chat, G-P-T-4,
multimodal interface.
She said, give me instructions on how to prepare this,
this being a brioche bun, weak egg yolk hamburger that looks absolutely delicious.
And this is the same image we used last week, correct?
And so how did it do versus chat GPT4?
Yeah.
So I'll say, look, chat GPT, you know, V did a better job in terms of describing what it was and then the instructions.
But, you know, for me, what I'll say here is the fact that this is available and it's open and we can bill from it.
It's one week later.
That's that race that's starting to collapse.
Now, we're not like 100% equal a week later, but being, you know, my grade on this would probably be like, you know, in comparison to that, it's like, you know, maybe a B because it doesn't, here's here's the results right here.
Yeah, if you give them an A, you would give this student a B, which means, hey, this student applies themselves.
And if you were, let's say, a startup, would you tie your wagon to chat cheap T4, a closed system with deep ties into Microsoft?
offer. Or would you fork this and start building with your own hooks into it and wrapping?
What would you advise a startup you've invested in? What would you advise them to do, Sunny?
Or would you have them split the baby and do well? It would be like on a use case by use case,
right? Like I think it really depends on like what value you're trying to build. Like if you're
really want to, you know, have the whole stack in your control because that's how you can provide
value. I think I would do this. Like I would use, you know, lava.
but if it's just a small feature within my product,
then I would use chat GPT because I'm going to get there quicker
and I don't have to worry about the infrastructure and scaling costs.
So the reason I didn't do this one live,
it probably takes like two or three minutes to run.
Sure.
Because they don't,
and this goes back to the funding thing.
They don't have the billions of dollars behind it
to have huge farms for inference.
And so this thing runs a bit slower.
And you can try it out live yourself as well.
J-Cal, I drop the link for us here.
And, you know, I think that's,
Yeah, they don't have the 10,000 GPU cluster that Open AI has.
Exactly.
And but, you know, there's going to be cases where for your business, it's important that you build this and you build some IP around it.
And so I'll go back to a point that I made a couple of maybe episodes ago, which was the API for GPS, right?
The API for GPS on a cell phone been around for a long time.
Apple, you know, made it slightly better.
But companies were built when they built a lot of infrastructure.
around that, right? And saying, hey, like, you know, I'm going to build my app there. And I'm not,
this is just one part of what I'm trying to do. So I think you have to kind of look at in some
places, you know, how important is this to the core of what you're trying to build. But I, I do
want to reiterate, like, look how fast, how quickly this is happening on the backs of it. And I do think,
it's why it was great to have Bill here earlier. This is why there's so much noise around
slowing this down, because if you're this company, you know, being valued at, you know,
whatever, 20 billion, 50 billion, 100 billion,
these ridiculous valuations right now,
to have an open source model right on your heels,
it's got to be really challenging.
I'll put that back to you as an investor.
Anybody who wants to buy shares in OpenAI at 90 billion
for common shares, I take it, with a capped upside,
you would probably want to monitor these open source solutions
and say, well, if they're getting 90 times revenue,
or 100 times revenue, whatever it is,
for the current valuation of OpenAI,
is that a good bet?
Or are these open source projects
going to create downward pressure?
And the downward pressure would provide
is the API calls
and what chat chip TEPT for OpenAI
could charge for access to their language model
is going to go down
because you could just fire up your own open source
solution on your own servers.
And this is where Microsoft,
Google, and of course,
We didn't talk about Amazon's position here, which is, I think they also invested in Anthropic.
Anthropic, yeah.
Amazon's been very clear that they would like to be a neutral third party in all of this.
And so, Amazon will make money no matter what.
They're going to have every language model on AWS, and their interest is in continuing to lower the cost, continually, and just make up for it in scale.
But this could be, you know, a road to nowhere.
This could make Open AI be like SeaGays.
you know, it's like a hard drive provider.
Like, there's just not a lot of value
capture there.
You know, in the cloud today, you know,
for the most part, obviously, you know,
some people do do this specifically,
but when you're asking for a server,
you don't really care whether it's,
you know, Intel or AMD or, you know,
some kind of a thing that's virtualized
running, you know, arm underneath it, right?
And so, especially, you know,
now if you have specific use cases that require
that type of, you know, like a certain instruction set that only Intel provides for performance,
you'll ask for it. But I think, and those clouds have all those options available.
All right, everybody. Stephen Estes is a principal at CLA. Clifton Larson Allen. This is a professional
service provider that specializes in CPA, tax consulting and wealth advisory. Welcome to the
program, Stephen. Thank you for having me. So at what point should a startup seek out professional
accounting and tax services? I think there's a couple tipping points, right? One of them is
really as soon as funding and equity-based comp come into play. So whether it's a 500 case,
you know, seed or pre-seed or a $10 million series A, I mean, as soon as you're raising and you're
looking to hire talent and give equity to those people, you should have quality advisors for both
legal and tax. Like I said, another market really is that foreign activity, right? Having a
foreign subsidiary or foreign founders, you can really find yourselves in hot water real quick if you
don't know what you're doing in that regard because all companies have to play by the same rules in the
international sandbox, whether it's a startup or Coca-Cola.
Get started right now at C-L-A-Connect.com slash tech.
Let them know your boy, Jake Howl sent you.
C-L-A-Connect.com slash tech to get started right now.
Let's go back to Mishrel for a second.
I thought this was fascinating as well.
Yeah.
I just ran this live while we were sitting here.
So the folks that were watching, you could see that's the time for it.
Yeah.
But.
Okay.
So suffice to say, you run it on your own servers.
You can go faster.
And it's going to just get fast.
faster every week.
And then, you know, these open source projects can sometimes have such a diversity
of talent in them.
And because they don't have a command and control structure, they're going to have more
interesting insights, right?
That's the nature of open source projects.
You might have these four or five, you know, developers in South America and then these 12
in, you know, Japan, and then these six in Ukraine.
And they all have some other use case and they contribute to the model in a way that's different
than if Microsoft or Google
or Open AI or building software.
They don't need to ask permission
to work on some part of the model
or the open source project, correct?
Yeah, I mean, not at all.
It's permissionless.
Yeah, yeah.
Now, there might be some permissions that occur
when you do want to actually commit those changes.
Exactly, yeah.
So to go, now, usually how these projects are governed
is, you know, they, you know, like Bill was mentioning, they have organizations that are built to,
you know, kind of decide, like, you know, where the direction of the project is going.
And so those then directions then basically the developers on the open source project use those
high level. Like, think about that as roadmap and vision. Say, oh, we want to build this next
and everyone aims towards it. But nothing stops you from taking it if you don't agree with that
and doing something and then, you know, submitting it for commitment into the project. And
either they can take that or you can fork it.
Many projects have been forked, right?
And so that's happened over the years as well.
If the core project doesn't want to go off on your side quest,
you can just make a copy of any, you fork it.
And you then start a new project.
And we've seen that happen over and over and over again.
Okay, let's get back to Mistrel.
M-I-S-T-R-A-L.
Yeah, correct.
Okay, now you mentioned how many parameters.
Let's explain that for civilians listening.
Yeah.
Yeah.
So the way I kind of put this into simple thinking is more parameters equals, like,
parameters are like neurons in a brain.
And so if you look at a small organism, it doesn't, you know, has a very small brain,
doesn't have a lot of neurons.
I think it's generally considered that humans have somewhere between 40 and 80 billion neurons.
And so when we think about it,
the more neurons exist, the more data that this has access to that not from like, well,
it started from what it was trained on, but that data is then processed and held in kind of
these weird fragments that we've talked about before.
And so the more of that that's there, the more it's able to basically reason, thank, provide
you really, you know, incredible results like we've seen.
And so these folks at Mistral have done an incredible job of creating a smaller model that
performs like the bit larger ones.
That's, you know,
probably due to some of the technological,
um,
approaches they've taken and also perhaps even the training data.
And,
um,
and they've shown that in,
uh,
like knowledge and reasoning and,
uh,
across these different tasks,
uh,
which,
which,
you know,
math code,
uh,
which is really incredible.
I thought this was very exciting.
So the exciting thing here is,
you know,
people are taking different approaches.
They're making,
um,
you know,
these language.
models more efficient and faster, smaller, better, cheaper, all of those things.
And how many language models is the open source community grinding on right now?
And when I say grinding, like making daily progress on, like, you know, of the, you know,
there could be some projects that are abandoned, et cetera.
So like listing all the language model projects on GitHub or wherever hugging phase,
that's not productive.
but let's say major ones that are getting daily updates to them, right?
Because daily updates would be a sign that this thing is cooking.
So how many are they to do?
There's probably under 10 that are like, and so what we really have to do is maybe take a step back
because the space is playing out in a really unique way.
The folks that are, there's a less set of people that are going after really, really large models like Open AI that are general and
intelligence models, right?
Where more of the energy is going,
a smaller run in,
you know,
kind of more confined,
uh,
compute requirements.
And that area has a lot of models,
like too many to count.
And what many folks have realized is that it's a better approach to go after a smaller
model,
um,
that's,
you know,
tuned or trained for specific tasks than to try to compete with,
uh,
general purpose models.
Yeah.
And so,
that means,
that when we talk about this competition
and regulatory capture,
if you're open AI,
if you're anthropic,
and you want to lock all this down,
if you're Reed Hoffman or Musafah,
or Sam Altman,
you know,
Greg, whatever,
you're saying,
hey, slow it down.
Trust us.
We're going to protect everybody.
You then have,
they're under attack that,
hey,
I'm going to make a verticalized one
that's just for audio,
just for video,
just for,
code, just for, you know, a specific language or a specific culture, whatever the vertical is,
you're going to see, you know, these large language models be attacked by a thousand cuts,
by verticals, and by thousands of people contributing to an open source model.
Is the goal to do smaller, more nimble models, like, and have the best model with the least
parameters. Is that like an attack vector here that people are trying to make these things smaller
and more efficient and cheaper to run? Yeah. I think that's the ultimate goal because what we've
already seen in the last year and, you know, with the rise of Nvidia stock was these really,
really large models require, you know, compute and energy, like less people talk about the
energy, but compute and energy, when you have like a cluster of 10,000 or 100,000 GPUs,
the amount of energy it's using is really substantial.
I wonder what one of those costs to run a day, like one of those GPUs, you know, an H-100 per day if it's actually doing work, it's doing jobs.
So maybe the guys can look at this one.
But honestly, I think it was like the amount of electricity more than like a house.
It is like something really substantial that we can probably research in the background here.
Well, and then a house might cost $1,000 a month or something.
Yeah.
So if these things cost $1,000 a month to run.
Yeah.
Yeah.
And you've got $1,000.
of them. That's a million dollars a month. It's 10 million a month in energy costs for 10,000
of them, which I think is what some of these big clusters are now doing.
Yeah. That's not nothing. You know, it's like a hundred and twenty million dollars a
year and electricity cost on top of these things costing 40K each or whatever they cost.
Yeah. Yeah. Yeah. The energy cost might be 25% of the yearly cost. So over four years,
it might be the same. Yeah. It's crazy. Yeah. So I'm just like, I'm looking at it here while we're
doing it, but it basically is saying like,
an H-100 card runs at 700 watts, right?
And so think about that 24-7,
that turns into a really, really big night.
That basically turns into,
I would say my guess,
is close to a megawatt a month.
Yeah, so we have to then figure out
what a megawatt a month cost.
And here's from Reuters.
Running chat GPT is very expensive for the company.
Each query costs roughly four cents,
according to an analysis from Bernstein.
if chat GPT queries grow to a tenth the scale of Google search require roughly 48 billion
worth of GPUs initially and about 16 billion worth of chips a year to keep operational,
which is just crazy.
But I think this is going to plummet, right?
It's going to go down 50%, 90% a year.
So, and yeah, I get the sense this is going to be, you know, if it's 4 cents a query,
it's going to be 0.4 cents and then 0.04 cents.
And then nobody's going to even think about it.
Kind of like storage became so de minimis.
But the energy cost is pretty crazy.
There's been reports that everybody's making chips.
So OpenAI supposedly is looking into making chips.
We heard last week that part of, I think, the anthropic deal was that Amazon was making
their own chips and they wanted anthropic to use it.
Obviously, Apple makes the M1, the M2, whatever's at the A, whatever they're up to, A14, 15, 16,
whatever's in the iPhone.
So we now have everybody's going to make their own chips.
I heard Google is going to make their own chips too.
So now we have...
Google's been doing it for a long time, right?
They have the TPU, right?
Which is, yeah, they've been doing that for a very long time.
There seems to be some intense ramp up of this
because not only can't you get Nvidia chips,
there's a line out the door for them,
people maybe don't feel great about the pricing of them.
So now everybody makes their own chips.
So that is something I didn't anticipate.
And so here's the headline from Friday, Microsoft to debut AI chip next month.
That could cut Nvidia GPU costs.
And that's from the information.
So I started my career in actually making chips, but for networking equipment.
And this happened in networking at the late 90s.
everyone needed their own chips to basically interface with the immense growth of the internet
on the sort of the core side of the internet with optical, you know, with fiber optics, right?
And so everyone was making chips there.
And there was a time in place, you know, somewhere between, say, 97 and 2001 where there
would have been like hundreds of chip companies making front-end chips to interface with,
you know, optical transceivers.
That ultimately all consolidated down to maybe like three.
companies. And it happened for the same reason. No one could get these chips, right? And then no one
could develop them in ways that each particular vendor required. And I think we're just kind of
seeing that movie again here. And we'll see an explosion, which will bring the cost down. And then
ultimately, I think it'll consolidate as well. But those are the waves that we see in tech anyways,
right? Where we see like expansion and consolidation and expansion again. And so a massive amount
of investment and then it becomes commoditized or so cheap that
people maybe some number of players
bow out and let...
Or roll up and...
That's another way.
I mean, we saw that.
I predicted this in GPUs
because we saw it with fiber
people were building out so much fiber
that they overbuilt.
And then I think Google and some other providers
bought up a lot of that fiber
you know, pennies on the dollar
that have been overbuilt.
So we might be in the overbuilding,
overbell phase.
What other demos do we have for this week?
Yeah, so I just had like
mistral, which, you know, we can run through that one.
Actually, like, we just, I had the paper there, so I can pull that one up.
And so this is their playground on hugging face as well, where we spend some time.
All right.
Let's, uh, give me a, give me a drop in here, J-Cal, and I'll send you this link to if you want to try one.
Give me a question here.
Oh, question?
Well, you know, how about what are the best restaurants?
Ooh, okay.
in Napa.
That's it.
I mean,
this is like some live information
who knows where they got this information.
But it's screaming fast,
even on hugging face.
I mean, instant answer.
So that was,
that's the first thing I'm noticing
is that this is.
And,
yeah,
it,
of course,
got it exactly right.
Bouchon,
French laundry,
Sobar,
I mean,
ad hoc.
Yeah,
these are all great ones.
Botega, I know a bunch of these.
And if you asked it,
what,
add,
put those in a table and add the average cost of dinner.
I wonder if it has anything there.
I don't think so.
You know,
that would be like more live data, but it did pretty good here,
you know, depending on what.
So, yeah, I don't think it'll have that,
but like, yeah.
So, yeah.
Yeah.
But, you know, that's a tough one.
I think even for,
GPD, but what I really liked about this one, and I'm glad you did this question, was like,
it, and this is what we're going to get as we put these out there. It first of all kind of says,
hey, in order to provide the best, but here are my assumptions, right? And it talks about,
hey, you can define best. And here's like a combination of food quality, good service and that.
And then target audience and research how it got there. I really, really am a fan of like,
like you said, the speed, you could run this on your laptop if you want to, right? And I think
That's really incredible here.
Yeah.
So you could give this language model to an astronaut, you know, going to Mars.
And if it lost contact with the Internet, it would still be able to give those answers,
which is just for people to think about this.
If you were stranded on a desert island, having one of these language models fully trained on a laptop,
you would be able to have like an increasingly impressive,
conversation that could wind up saving your life.
If you asked it how to start a fire on a desert island, what would it say?
Yeah.
I mean, actually ask it.
You know, I'm, I'm trapped.
If you were trapped on a desert island, how would you start a fire?
Now, you know, if you didn't go to survival school and you had this, all of a sudden,
you've got this bot with you on some foreign location.
I wonder if it will actually even help you build that fire.
You could, you find a piece of glass is one technique rubbing two sticks together.
There's another obvious technique trying to find flint or stones or metal that could spark something.
Gather material.
Collect dry leaves, small branches, of course, that's kindling.
Find a suitable location, okay.
Create a fire pit.
Prepare the Tinder, okay, this is all the same stuff.
Kindling, prepare fuel.
Create a fire layer structure, okay?
Ignite the fire.
There are several methods to start a fire, such as using flint and steel magnifying glass or a battery.
I forgot the battery one.
So that's interesting.
So tell it to explain to me the techniques in number 10.
Let's see if it understands that we're asking it about its existing answer.
Because steps 1 through 9 are preparing a fire, putting the wood together, but not actually sparking the fire.
What we care about is actually starting it.
I wonder if it will be able to teach us how to start a fire.
Oh, yeah, here we go, Flint and Steel.
This method involves striking Flint against Steel Rod, okay, magnifying glass.
on a sunny day focus,
sunlight onto the Tinder using a magnifying glass battery
and steel wool, that works, yeah.
A lighter of matches, yeah, if you have them.
It's pretty good, pretty good answer.
And what else do we have in the demos?
In the demos.
Everybody loves the demos here.
Yep, so last week, we kind of,
right at the end, we got into WhatsApp
and the model and the AI.
And I think one of the things I wanted to correct, so I wanted to pull WhatsApp back up because, you know, it is available in the group chat.
So I was getting a second here.
Yeah.
So you can do a group chat and include an agent.
So because that was one of the reasons you didn't give it even a higher grade last week.
Yeah, and I loved it last week.
I was super impressed.
And I just thought, you know, if my wife and I had a chat going and we had slash Gordon Ramsey and we could, you know,
Ask Gordon, hey, here's what we have.
What should we make?
Hey, we got some salmon and we've got some pasta.
You know, we've got butter and cheese.
And it was like, okay, yeah, you make some farfali with salmon in it.
Here's a recipe for you.
So this is our AI, you know, twist WhatsApp group that I created.
Let me get this over the side.
And in here we have the meta agent, right?
And so, and if you can see here, you can do sort of an at, and it allows you to add meta-a-i
and say, I am thinking about booking a trip to Napa, what are the top five hotels?
Okay, here we go.
Let's see.
And what's great about this is, like, I could then ask a follow-up to it, right?
To respond back in a second, maybe they've gone slow as well now.
There we go.
Well, there we go.
Yeah.
Arbosge.
Meadowwood, four seasons.
Yeah, I mean, it's not great.
It's not terrible.
Yeah, not great, not bad.
What's a fun activity for the three of us to do while in,
Lake Tahoe over Christmas.
I mean, it's pretty obvious what you do in Lake Tahoe over Christmas, right?
Consider ice skating, taking a sleigh ride, enjoying a festive atmosphere.
Yeah, we could have also gone skiing, but we're snowmobiling.
Not a great answer.
Not a lot of snow at Christmas sometimes, though, but maybe it knows.
Yeah.
How about more answers?
Additionally, you can explore Emerald Waters of Clear Coyac tort, not in the, not
in the winter.
You could do that.
So you can see the meta-a-i is like a very rudimentary AI, I think.
I'm not sure who.
I'm not sure which model that is.
That's Lama.
Must be, right?
Is there a language model?
Yeah, it's a law month, right?
Yeah, it's definitely that one.
Yeah.
Yeah, I think that leaves a lot to be desired right now.
I think this is where I think verticalized AI is going to be much better.
when you ask about travel,
you really need a travel AI
that's just that, right?
So I think that that's where these models
are going to wind up is that they'll be very
verticalized and fine-tuned ones
by vertical.
And if you ask about food,
it should really be narrowing you down
and not just giving you the generic one.
It should be,
just like when you search now
for recipes on Google,
it doesn't just give you 10 blue links.
It really is thoughtful,
and they have a lot more information on it.
Listen, there's been another amazing
episode. Thanks for Bill Gurley for tuning in. It's pretty clear.
The community must fight for more open source. Anytime anybody says, trust us. We will be
the sole source. You should be wondering if you should trust them. And just, listen, you
need to fight for open source and you don't want one private company. It's not a dick to any,
you know, it's not a dick to Sam. It's not a dick to Anthropic. You don't want any one party controlling a platform
technology, you want it to be open source, you want closed source solutions, private
company solutions, open source solutions, you want a range of opportunities. And remember,
the last time we had some founders say, trust me, I'll get us some regulation. That was SBF.
And he was going to be the one who got us federal regulation for crypto. And he's on trial.
That's a very moment for a bunch of Mushugina. I mean, are you following that? Sonny, you were down
the crypto rabbit hole.
Yeah, and, you know, very deeply and following it, and it's really sad to see, you know,
I think one of the, you know, I think one of the more, there's a lot of shocking things,
so not to say it, but one of the more shocking things I saw last week is they were running
some kind of insurance service.
And there was like a counter that was saying, oh, like this much money is insured by this.
And that counter was basically like a random number generator that was like picking a number
between like, you know, $7,500 and $10,000.
And like, I mean, talk about explicit, like, fraud.
There you go, exactly.
I mean, it's deranged.
Your point is if, yeah, to say that like, you know,
this was a real business and, you know,
that they were just in over their heads
when they were doing free meditated things like this.
And this is where like all the chats eventually get done
all the emails, all the slacks, everybody winds up flipping.
No matter what the case is, it's very rare that a group of people will circle the wagons.
Like, even the mob, you know, and the mafia, you know, had a hard time maintaining that.
And this is like their entire lives, their families, their traditions were around this, right?
Yeah.
Maybe the cartels on the margins can, but not a bunch of dopey kids who were working together for 18 months.
They were committing fraud after fraud.
in this case,
they were using a random number generator,
you know,
with some parameters on it to dupe the public
when they were getting insurance.
Yeah.
If I'm understanding it correctly.
Um,
and listen.
And it takes,
like it takes,
you know,
like a certain amount of evil to do that,
right?
Because you're,
you know,
there's lots of cases people have gone sideways,
like,
you know,
Theranos,
Elizabeth Holmes.
And,
you know,
at the core,
I think she was trying to make,
you know,
like a testing machine.
It's just,
uh,
kind of,
That's the case of it maybe got ahead of you, right?
You got ahead of your skis.
You were a bullshit artist.
You thought, ah, you know, I can fake it till I make it.
This wasn't fake it till you make it.
This was, let's orchestrate a huge crime.
And, you know, listen, if you're in the mob and you flip, you get whacked.
With these kids, like, if they flip, there's nobody there to whack them.
It's not like SBF's parents or like some criminal mastermind cartel.
They're a bunch of dopey Stanford professors who, you know, didn't
raise their kid correctly and, you know, they're not going to whack the other members.
They're all going to flip on each other. It's all going to come out. I told everybody, I will bet
dollars to donuts that he gets over 30 years. I'm saying over 30 years.
One thing in defense. We take the over the under a 30 year sentence. I would take the over.
Yeah. If I said 50, we should take the over and now. I think under. I think so 40 would be the line.
Yeah. We have to think it through. What would you take up 40?
Uh huh?
Pardon?
We take the over to the under at 40 years.
I think the under.
I think it'll come right around that 30 years.
Okay, there we go.
Wow, look at that.
I said a pretty good line.
So the line is probably 36 years.
You know,
he gets out before retirement kind of thing, you know,
and that.
I think they're going to Bernie Madoff got multiple life sentences.
So I set the 30 year,
but I actually think there's a chance he gets life.
I think this could be a life kind of situation.
Well, you know, it's very rare that you catch somebody red-handed.
Yeah.
doing a multi-billion-dollar crime.
There's a small number of multi-billion-dollar crimes in the history of humanity.
Yeah.
It's hard to pull off.
And so if you're going to put, you know, some, you know, kids in a local city who were dealing drugs 20 years, 30 years, and they were dealing, you know, a half million dollars in drugs, a million dollars in drugs.
Let's just call it a million dollars in drugs.
A million dollar drug, you know, cartel, you know, in Chicago.
And this kid was stealing over a billion.
I mean, it doesn't feel proportional, right?
If he, if he were to get anything but a magnitude more because it's a thousand X that crime.
Yeah.
This is serious crime, folks.
Yeah, I don't know.
Yeah, I mean, you're spot on there.
I just don't know how to, like, do that relative calculation.
I mean, like, there's people in jail today for marijuana use that, like, and, you know, it's, it's allowed now.
Right.
Yeah, no, luckily we're, we're, there is, uh, consensus.
Trump, Biden, Obama before everybody is, you know, a line that we need to reverse those for nonviolent felons.
Um, and really think about those.
All right, listen, another great episode.
And, uh, you know, listen, um, he's guilty.
I've made my decision already.
It's the evidence is there.
The jury will make their decision, but, uh, I hope he gets a, a huge sentence.
Uh, and that's it.
All right.
Cindy Madra, they have it, folks, Definitive Intelligence.
If your company is looking for AI and analysis of big data and you need help, Sunny's your guy.
Just email Sunny at definitive.io.
And we'll see you all next time. Bye-bye.
