This Week in Startups - Republic opens private markets as Positron takes on GPUs: A TWiST500 doubleheader! | E2176

Starting point is 00:00:00 It just strikes me that we're going to have a future that's going to be a little bit more efficient than people think. We're not going to boil all the oceans. We're still going to have power for our appliances. And we're going to have essentially infinite intelligence in our pockets and all of our screens. And to me, that's a pretty exciting future. And we better. We will need to have infinite intelligence. And that's the whole goal.

Starting point is 00:00:19 Like, if you ask in media, if you ask us, if you ask any other Silicon Company, I think they're all trying to say that, you know, we're approaching it with some different approaches. but basically it's like how can we bring cost per token or cost per generation down so that we can afford to have infinite intelligence while increasing our energy production as much as we can. This weekend startups is brought to you by Alpha Sense. Get deeper insights into your business with the power of AI search and market intelligence. Start with a free trial at Alpha-sense.com slash twist. Vanta. Clients and security shouldn't be a deal breaker for startups to win new business. Vanta makes it easy for companies to get a SOC tube report fast. Get $1,000 off for a limited time at vanta.com slash twist. And Oracle.

Starting point is 00:01:13 Oracle Cloud Infrastructure or OCI is a single platform for your infrastructure, database, application development, and AI needs. Save up to 50% on your cloud bill at Oracle.com slash twist. Hey everybody, welcome back to this week in startups. My name is Alex, and today I am bringing you two startup interviews. Now, both of these companies share a backer, Valor Equity Partners. So if you want to know what one venture capital firm sees in the future, well, here is a good look at it. First up, we're going to talk to Republic. You may know them back in the day as an equity crowdfunding platform, but since then, it's gotten into quite a lot more activities, a little bit of venture capital,

Starting point is 00:01:52 some secondary shares, tokenization of assets, you name it, they're working. They're working. working on it. I care about this company because it's democratizing finance and bringing more investment opportunities to more people. So that way when my kids are older, well, they're going to have a lot more different things to put their capital into than I did when I was their age. Then we're going to talk to Positron, another company that wants to build chips for faster and more energy efficient AI inference compute, given that around the world we're seeing data centers run into power availability problems. What they are working on could have enormous application, enormous revenues, and also may just challenge NVIDIA a little bit around the edges

Starting point is 00:02:27 in the next couple of years. These chats were a lot of fun. I learned a lot. I hope you love them. Let's get into it. Here's Republic. To set the stage now, making normally out-of-reach assets and investing methods more accessible to everyday investors is big business. Robin Hood grew on the back of zero-cost trading and bringing access to more exotic financial trading tools to regular folks, to pick an example. Now, Republic, in contrast, has long been known for its place. in the crowdfunding market. Spunned out of Angelus, the company has raised more than $200 million, including a well-known $150 million round in late 2021, led by Valor Equity Partners. Now, in the intervening years, Republic has expanded its feature set far beyond traditional

Starting point is 00:03:09 crowdfunding. So to tell us more about the state of alternative investing and where Republic sees our democratized financial world heading, please welcome to the show. It's Kendrick Wynn, co-CEO of Republic. Kendrick, how you doing? Alex, thank you so much for having me. me, I'm doing fantastic. I also love that we have in your background, and I say this with nothing but love, the de facto Silicon Valley office.

Starting point is 00:03:31 It is standing desks, multiple monitors, and people typing. And that is exactly my happy place. I love to see it. How many folks do you have behind you? You know, maybe about 40 or so, and we're in Silicon Alley. So New York has become quite the tech hub as well, and we're super proud to be here.

Starting point is 00:03:47 But our heritage and our route go all the way back to Silicon Valley a decade ago. So this is off topic, but I'm just curious while I have you here, and we're talking about the geographic split, because I'm on the East Coast too. As the co-CEO of a leading technology startup, do you ever feel that magnetic drag back to San Francisco? Because the way I hear it told, in the AI era,

Starting point is 00:04:08 SF has once again regained its primacy as kind of, if you will, the New York City of Tech. You know when we launched Republic back in 2016, Naval Ravica and my... mentor and boss at the time, was like, hey, everyone is, you know, based in all come to Silicon Valley, why are you looking to move to New York? And my view is twofold. One is that we already have Angelis and San Francisco as the roots, right? That's how I used to know a lot of the VCs for my days at Angel is. And secondly, if you're building FinTech, then I think it's a good thing to be based

Starting point is 00:04:44 out of the financial capital of the world. And that's obviously New York cities. So in the particular industry that we're in, in the business that we're in, I think that us being headquartered in New York with a lot of trips and events and network back into the valley is a very ideal way, an ideal setup for us. Yeah, I don't even want to know how many frequent flyer miles you've wrapped up flying to SFO over the years. Okay, but let's get down to brass tax here. I know Republic. I've known about the company since it was founded. I've always thought about it as a crowdfunding platform. Clearly, you guys do a lot more. And we'll talk about that in a second. But just to get people caught up, what is the state of crowdfunding today? It was pitched as a way to really break open a lot of private companies for folks. So has it lived up to that potential? And is it still growing.

Starting point is 00:05:39 Alex, if I may just define crowdfunding first and foremost. It's that is so broad. Kickstarter is crowdfunding. It's Indigo is crowdfunding. Angelis is crowdfunding. You have accredited investor coming in to co-invest in a deal. A republic only took advantage of a law that changed in 2016 that allowed for anyone. No matter what net worth, what income can come in and back and invest and have equity upside in a wide variety of businesses. So fast forward nearly 10 years now, there's no doubt that the business model has proven to work,

Starting point is 00:06:17 not just for early stage tech companies, but late stage, movie financing, music financing, crypto, that is digital securities being fractionalized. So I think that we're still in a long first inning this next era, but there's no question that is well on the way. But I think the true potential, the size of it, is something that is to be seen in the months and years ahead. I appreciate the clarification. I probably should have said equity crowdfunding versus crowdfunding writ large.

Starting point is 00:06:53 I mean, one of my favorite bands, Arch Spire, is crowdfunding their next record. That's not the same thing as what we're discussing here. Okay, so fair enough. But there was a change in late 2020 that raised the cap that a company could raise in equity crowdfunding dollar amount from, I think it was Kendrick 1.1 million to five. And so I'm curious now four or five years past that point, what impacted that have on an equity crowdfunding and also on Republic's business? You know, it brings later stage companies because $1 million, it's still a lot of money,

Starting point is 00:07:23 but, you know, for a company that's already in this series A, series B, series C, $5 million as a cap is a little bit more meaningful. They can engage in bringing more customers, more community members. And, you know, I'm optimistic that cap will continue to go up, you know, to perhaps 20 millions. And even there's been talk in in D.C. of potentially expanding it to $100 million and beyond. So again, just the very beginning, but there's no question that currently, as is,

Starting point is 00:07:56 we're working with large enterprises across a wide range of sectors and industries who want to engage the public through this regulatory framework in the U.S. So generally speaking, I'm in favor of the cap going up. I'm curious, though, is there a ceiling that you would put on it? Is there a dollar amount that's probably too big? Or is there no real limit here in your view of how high we should raise that, that max limit?

Starting point is 00:08:22 You know, I am definitely a firm believer in free market to do so sensibly. But I think these arbitrary cap on five millions or ten millions, I think it's a little bit artificial. I think there are other ways to address to protect investors' interests rather than this arbitrary threshold. So if it were up to me, I would not put a threshold on it, similar to, you know, under Regulation D or if you're going to go public, you know, for example. Yeah. No, I'm with you on that. I was curious if there was any technical or hidden risk that I wasn't thinking of, but I'm glad that we're aligned there. Now, equity crowd funding, one part of the business, clearly not the entire thing. You also have Republic Capital, which I believe has something like 500, 600 million AUM. Is that a traditional venture capital vehicle? that I should kind of think of as in competition with a Sequoia, or does it have a different posture on the market that I should keep in mind?

Starting point is 00:09:21 Alex, it's a great question. And allow me to define what we are today. And I think it ties everything in together. The business of our public is building the infrastructure that can accommodate capital raising and community investing next generation. What does it mean? It means that the regulatory framework to take in non-accredited, retail, the ability to accommodate institutional as well to SBV or direct, their ability to do

Starting point is 00:09:50 secondary trading, their ability to tokenize and put a, whether it's a small business or a, you know, a certain crypto project on a revenue sharing basis on chain, right? So when we first launch, the first piece was equity crowdfunding. We added on capital and of course to generate revenue, we did syndication and other things, but think of it as different components that when all in together and able an enterprise to fractionalize, tokenize, and engage with the retail public globally with liquidity, that's obviously a very ambitious plan. And it took us 10 years to get to where we are today, which is the first year that we have a completely functional infrastructure for RWA real-war asset tokenization for true retail participation.

Starting point is 00:10:46 We're the only one as far as I know in the market that have that complete regulatory framework. Now, we do have, you know, we add on business line, business model, but no, we're not looking to compete with Sequoia. We do SBV. We do syndication. I'm wearing a shirt from one of our portfolio company, called K2. But the business, Sequoia's venture capital, the business of Republic, is in finance. infrastructure and asset tokenization.

Starting point is 00:11:13 I don't want to ruin your day, but have you taken a look at your cloud computing bill lately? They probably give you some kind of deal to start, but over time, those bills start to add up. Well, our friends over at Oracle Cloud Infrastructure want to help you cut your cloud bill in half. Yes, that's right, 50% while you're getting better performance at the same time. OCI is a next generation cloud designed to work with any application, including AI. It's faster. It's more secure and you can do it for less. We're talking complete cloud infrastructure and services regardless of your specific setup or workload. And OCI costs significantly less than other clouds with a span of 50 interconnected cloud regions and more than 150 OCI services apiece so you can access your cloud from anywhere and keep your prices consistently low worldwide.

Starting point is 00:12:06 So join modal, Skydance animation and more innovative AI tech companies who upgraded to OECD. and saved. See if you qualify for half off at oracle.com slash twist. That's oracle.com slash twist. This offer is only for new U.S. customers with a minimum commitment. Okay. It's almost like you read by notes because that's where I was going to take us after a couple of questions. So we're going to loop back to the tokenization and the platform element of this. I just wanted to break down the different things you're doing today and then we'll talk about how they come together. So Republic Capital then is a tool to collect retail and probably family office money and put it into SPVs.

Starting point is 00:12:47 Is that the main work that it does? Correct. That you aggregate larger checkside from family offices and institutions into SPVs and deploy them into more mature companies that's typically private equity of venture capital, and that is the public capital. And then you have a separate product called Republic Venture that is only open to accredited investors. So not the, not the masses, as it were, but folks that have made a certain test or threshold set by the government, and you and I have probably similar views about that.

Starting point is 00:13:19 But I'm curious, can Republic Venture customers invest in the Republic Capital product? Yes, I mean, Republic Venture is just earlier stage. Okay. And the Republic Capital aims at larger check in later stage, you know, companies. Again, so you can see within Republic that we have businesses that almost, replicate miniature version of the entire ecosystem, right? In the broader market, you have venture firms, you have PE firms, you have platforms. We have all of these things just so that we then can replicate and change and bring forth the entire financial ecosystem.

Starting point is 00:13:59 And another element that you're bringing forward is secondary trading. You guys purchased Cedars, which is now Republic Europe, and you've also moved into secondary trading. We talk a lot about secondary shares here on Twist because, you know, there was the venture liquidity crisis. Everyone was desperate to get a little DPI. I'm curious, Kendrick, how big of a business is the secondary trading part of Republic today? Is it large? Is it small? I'm just curious relative to the rest of what you do. Yes. So today's still relatively small. That is, you mentioned Cedars, which is now Republic Europe. That's our UK European side. It does both primary and secondary trading. For example, if you go in Republic Europe, you're going to see Revolut.

Starting point is 00:14:40 one of the largest private, you know, neobank out of Europe. You're going to see shares of Revolut being actively traded among non-accredited investor on Republic Europe. In the U.S., the regulatory framework is a lot more complicated. That's why Carter failed at it. That's why Forge an equity day and equity then only deal with, you know, high net worth investors. That's why Angelus, there's some secondaries.

Starting point is 00:15:09 I mean, I think we did the first one when. I was still there and structured it. But that's only for high net worth individuals. No one has managed to do secondary trading for non-accredited investor in the United States, a company that we are in the process of acquiring called IANX. And INAX is a very unique set of licenses as an ATS that has the ability to do so. So we're very excited to roll out at scale true secondary trading of private securities for non-accredited investor sometime over the next six months or so. Kendrick, when I hear the acronym ATS, I think applicant tracking system, I presume that's not what it means.

Starting point is 00:15:51 So can you define that acronym for us? It's automated trading system. Ah, that makes way more sense. Notice, you know, in exchange, I think this is just a legal thing. For most people, there's no difference. But regulatory-wise, you call yourself an exchange. You take on certain regulatory obligations. if you call yourself an ATS, which is like an exchange light,

Starting point is 00:16:12 then you're under a different regulatory framework. So one, I don't mean to be a spoil sport, but one thing that has always made me a little bit leery of secondary investing is a lack of information. And there seems to be a pretty big disparity at times between what primary investors get in a venture round and what secondary investors may get later on if they purchase their shares on equity for which are on Republic, et cetera.

Starting point is 00:16:36 equity zen, I guess, or forged global. How do you fix that? How do you make sure there's enough information coming out from these companies to allow for a retail investor to take a position in Revolut, for example, without just gambling? Yeah. Alex, it's a great question. First of all, the same information disparity or the lack of or the insufficiency of applies to venture capital to the traditional private and public market. I'll give you an example. Sure. Investment on AngelList, right?

Starting point is 00:17:11 So if you have a doctor coming to AngelList to look at a YC company, information available literally fits on one page. It's not really a ton more information there. And of course, we trust that that doctor can make the investment sensibly. So about 10 years ago, the SEC in Congress already made a decision that, hey, just because you're not a millionaire, it doesn't mean that you shouldn't. be able to participate in the private market. In the same way that, you know, the information on Amazon when you buy a product may be different

Starting point is 00:17:48 than if you were to buy a product directly on the company's website, but you've got to trust in the market and in the public to make sensible decision for themselves. Now, the law does add on a few things. You know, ATSs and exchanges and broker dealers have the obligation to do their own due diligence and make sure that they introduce things that are suitable for their customers. But I don't think that the public disclosure requirement for the public market to your IPO is some sort of a goal standard that lasts forever. I don't know about you, but I, even though I'm relatively informed and do have a, you know,

Starting point is 00:18:27 meaningful public equity portfolio, I can't remember the last time I read a. 10K or an AK or an SEC filing before making an investment in Tesla or Facebook. So at the end- Sure. But Kendrick, you're taking on, so, one, I appreciate the clarity here. This is very useful. But it sounds like you as a company take on a lot of responsibility then, to not bring trash, essentially, to your customer.

Starting point is 00:18:55 So the due diligence is in some ways mediated by Republic. In a way. So our goal is to make sure that we have the legitimacy lens. And of course, we also apply an additional lens on suitability. But we don't aim, nor is it feasible to say that these are high-quality deals. And if you invest in hand and you're going to make money online, that's not how the industry works. I do think that market maturation in the face ahead is through education, onboarding the investor base, and then present legitimate, that is non-fructural and ideally companies that don't make misrepresentations and leave it to the general public to make that decision that if they have $1,000 to invest, they should invest, you know,

Starting point is 00:19:46 $15 or $20 in 50 different companies and not putting $1,000 in one. So it's about education and access, not about some curative lens of any one particular, firm or fund? Right. You make sure it's a suitable investment and not a clear fraud, but after that, it's up to the people to make their own choices. That's why that makes sense to me. Work faster isn't just my personal motto. It's the key to success for busy founders everywhere. Startups have always had to move quickly, but even with that pace, that doesn't mean you can ignore security and regulatory requirements. But fear not, there's Vanta. Their AI power platform

Starting point is 00:20:31 makes it easy to ensure compliance in days and keeps you on top of everything. So nothing holds you up today or down the road. Keep moving at startup speed with confidence because Vanta scales with you. With round the clock support in case you run into any problems, they're going to be there for you. Vant understands your company's needs because they're a fast moving startup too. In fact, I'm an investor. And man, that company is growing because they have a great product and they care. Just last month, Vanta added a bunch of new features.

Starting point is 00:21:01 including AI-powered questionnaires that help you breeze through vendor security protocols and autonomous penetration testing baked directly into the platform. Twist listeners get $1,000 off by going to vanta.com slash twist. That's V-A-N-T-A dot com slash twist for $1,000 off. There's a clear parallel to commerce and e-commerce as well, right? So like 40 years ago, what people purchased, which is basically what was, produce and made locally. With the advent of e-commerce, Amazon One Click purchase,

Starting point is 00:21:37 now we all buy things online with a magnitude of volume and choices. Many things we don't even need. And I think there's got to be the same lens that applies here, which is when it comes to commerce or investing, you make things accessible. You do require a company to post fair information, but you've got to leave it to the general public to decide and participate in shaping the technological and the economy of the future that we're all going to be living in.

Starting point is 00:22:11 We could solve this, by the way, by just having greater disclosure requirements for private companies, but that's my own hobby horse. I won't bring that up today. Okay. Now, Kendrick, we have to talk about a couple things here. So there is Republic Note, which is a, quote, revenue sharing digital security that allows you to benefit from the economic upside of select Republic portfolio. companies. And you also have a tokenization business and a Web3 console business. And my question, before you kind of brought this up earlier, was going to be, are you eventually going to have a single infrastructure layer that would underpin or under GERD each kind of like pillar that we've

Starting point is 00:22:50 described at the business? Because it seems like you're doing many things that all kind of point in the same direction. Is that where the company's going? For sure, Alex, it's already that way. So from the technology and legal framework is one system that is the, you know, let's say to do for a company to be traded secondarily, you have to have primary issuance. When you deal with primary issue and you deal with some are non-accredited, some are accredited, some are institutional, some are non-U.S. they require different regulatory accommodation and frameworks to bring them in. So we, over the years, all of these things are part of the same operating system where in the early days we add on distinct business lies only to take in revenue. But the legal infrastructure and the technical infrastructure is one cohesive one

Starting point is 00:23:50 that underpins a wide array of industry, again, to use you know, poorly Amazon as an analogy. Once you have the e-payment and e-commerce infrastructure, you can sell books, you can sell grocery, you can do movie streaming, widely different sectors in exactly the same way. It's just that when it comes to finance, it takes a lot longer to build because it's much more regulated. But we are building that e-finance infrastructure similar to Amazon as an e-commerce infrastructure for that work. Kendrick, would a better analogy be you're building the AWS for investing? Like, you're building a base level of infrastructure to allow a variety of different things to happen

Starting point is 00:24:32 on top of it. Yes. No, but there are different components. I think when I think of AWS, I think just like storage. In this one case, when you talk about investing, primary issue and secondary trading on a cross-border basis, then you have to deal with banking, disbursement, all of these components that, That power Wall Street now is being redone pieces at a time by Republic and a few of the firms in the space. So yes, we're building one singular operating system, but we're not Elon, so we have to generate revenue and not just, you know, for 10 years.

Starting point is 00:25:13 But, yes, we're still in the very early days of about to launch, nothing that we've done today is meant to be, you know, the business. Republic, they all did a version of what we're about to launch ahead. Kendrick, an absolute pleasure. Republic.com is the URL, and we'll have you back on in six months or a year to see how far things are going, but I would not bet against you. Thanks for coming on. The pleasure is, oh my. Thank you so much, Alex.

Starting point is 00:25:38 I keep really looking for it to next time. All right, so we're going to need enormous data centers all over the world, the size of Manhattan, the size of Wyoming. They're going to eat all the power, take up all the space and drink up all the water, or maybe not. There are a couple companies out there that are working on different types of chips that are going to reduce the overall electricity demands of our future AI world. I'm very excited about this because I'm a big AI bull. And also, I live in a state where we're trying to expand green power.

Starting point is 00:26:08 And it's pretty tough. So to learn more about these chips and the companies behind them, please join me in welcoming the CEO of Positron, Mr. Mitesh Agarwal. Mattesh, how you doing? I'm very good. Thank you so much for having me here. I'm really excited to chat, actually. And I do have a comment about the power if you want. Oh, I'll never take an offer.

Starting point is 00:26:26 I'll never not take an offer. So go for it. Well, one thing that I want to clarify is as much as, and we'll go into Positron and how we are trying to be very power efficient, but I personally am a very big believer in we should generate as much power as we can. So there should be no restrictions on power generation. Just use that power that we're generating a lot better. So that's the comment.

Starting point is 00:26:47 So you're an all the above kind of guy. I present small module reactors, solar, wind. Okay, for fun. What's the most exotic form of power generation you favor in the next 10 years? I have one. I have one, too. Electricity generation on wave, like wave movements, ocean wave. Tidal power, essentially, right?

Starting point is 00:27:07 Title power, exactly. Okay. There's a company called Exotic, so that's why. Well, no, no, I appreciate that. I think exo watt is a company that I've talked to, and they're doing storing industrial heat via lens to solar power. And I was like, oh, wow. That's awesome.

Starting point is 00:27:24 Okay, that's a more exotic. The company that parked in my head was Pantalasa that is doing the tidal power thing, where they're deploying data centers in the middle of ocean and using the electricity. Okay, so free cooling, essentially, right? I mean, if you do data centers in space, it's harder. You have to radiate it in all that business. But the oceans are big. Here's my concern, though.

Starting point is 00:27:44 And we'll probably cut all this out of the episode, but who cares? If you put all the data centers in the ocean and we're dealing already with rising ocean temperatures, Are we are we just slowly boiling all of the remaining ice in the broader ocean world? No, the effect is very, very, very minimal compared to what the heat is. But yes, that is actually a really good question. I'm concerned because that will 100% come up in their investor. Luckily, I don't have to answer to that. No, no, no.

Starting point is 00:28:10 Okay, so Positron. Let's talk about it. So Positron is a company that's building basically digital brains to run transformer architecture to power large language models as we understand them. today. But to make people care about that, I think we need to talk about why GPUs are not always the best computing choice for AI workloads. And people will be surprised, Mattash, because Nvidia is worth $80 trillion today. And everyone cannot get their hands on enough GPUs. So why are GPUs not the BL-Indol for AI compute workloads? Yeah. So the first thing I'll start

Starting point is 00:28:44 with is GPUs are like, especially Nvidia GPUs are the major like over 90% deployment of compute today, right, across both training applications and inference applications. So training is when you're making the model learn and inference when you're generating, whatever you're generating, tokens, videos, images, right? And today training is the majority of the compute spent. So it's still well over 65% of the compute spend, inference is the 30, 35% remainder of the compute spend. Nvidia GPUs super well defined and designed initially for shaded processing back in like 10 years ago when AI was not as hot. But then really found out that for Matrix, matrix multiplication, which is similarly in that realm, and which is exactly what is used for

Starting point is 00:29:25 training application, absolutely phenomenal. You are generally flops bound, so the amount of transistors and compute bound. So the more you can push flops through, the more you can really do more training workloads and much more efficiently. And that's what Nvidia GPUs are phenomenal at, training applications. On inference side, though, inference is a much more nuanced or more nuanced beast in the terms of It depends not only just on the flops, but it depends on memory capacity and memory bandwidth. And at any given point of time, depending on how you're running the workloads, any of those three could be your bottlenecks.

Starting point is 00:30:01 So, Nvidia is really good at inference, but is it the most efficient at inference for certain types of workload? No. There can be always, you can always push the envelope just by existing silicon technology, whether that's by maximizing memory bandwidth or memory capacity on that. And that's where Positron steps in. And this is where we are coming in and we're saying for inference applications, especially the frontier inference application. So really large, like really in multi-tillion parameter model sizes, video generations, which require massive amounts of memory capacity and bandwidth, we can effectively really scale out our memory capacity and memory bandwidth through our architecture. And because of that, we can be very efficient at inference. The last thing I'll end here, where the reason why NVDA is not the most efficient at inference is because although NVIDIA's,

Starting point is 00:30:49 using the latest cutting-edge high bandwidth memory, what is called HBM in general terms, for the memory thing, which is the fastest memory bandwidth on theoretical specs that is available. Unfortunately, when you run inference workloads on Nvidia, you are not able to fully utilize the available theoretical bandwidth. If you want to run a successful company or maintain a profitable portfolio like I'm trying to do, you need reliable data. And without the most accurate and up-to-date information, you're flying blind. That's why you and your company need a partner like Alpha Sense. They're the

Starting point is 00:31:29 world's number one most trusted AI platform for market intelligence. That means they can help you and your team track the latest economic trends before your competitors even know they exist. Hey, listen, hear a twist. We eat our own dog food. We only pick advertisers and partners here on the show that we love their products. Alpha Sense has a massive searchable database, of public companies, and that lets us quickly dig up the most relevant facts and context, and their equity funding screener includes private companies, not just the public ones. They save us time, and they make us superhuman. So start your AI search and market intelligence journey today and get deeper insights to power

Starting point is 00:32:10 your business. We're even going to start you off with a free trial. How great is that? Alpha dash sense.com slash twist to get started. Make sure you use that URL so they know we sent you, and you can get this great deal. That's where I wanted to go with this because I did talk to the guys over at etched back in March and a really good chat learned a lot. I would say they're probably your closest competitor, but they were really big on how Nvidia GPUs don't end up using all of their compute

Starting point is 00:32:36 power when running inference workloads. They were like 30, 40% efficient. I presume it's the other bottlenecks that come into play there that you just mentioned. Yes, correct. Exactly. So because you're not able to, because of the way the silicon and memory architecture is laid out and basically how the movement of bytes happen, Nvidia is not effectively really having the same ratio of memory movement to the amount of computer available. So they have a lot more spec'd out on compute rather than the memory movement. So they end up consuming less than what is efficient.

Starting point is 00:33:06 And there are many software tricks that all of us try to do, whether that's spec decoding, flash attention, all those things. It can drive up the needle, but it goes up from 25% to 40%. It doesn't take it all the way. Well, yeah, that's just super inefficient. Okay, so let's get to what you guys have built to start, which is the Atlas system. It has eight Archer Accelerator cards. It has up to two terabytes of system memory.

Starting point is 00:33:29 It looks like a server rack. Tell me why it's awesome. Yeah. So the main reason why we really wanted to do the Atlas system was if you look at Silicon companies, you know, any Silicon company, any Silicon company that over the last 10 years, they've all gone down the route of, okay, we're going to design our ASIC or Custom Silicon and it takes them three, four, five years to get a product out in the market. What we did with Atlas system was we chose FPGA as our baseline product.

Starting point is 00:33:54 We knew that Atlas has limitations in terms of specs. Like it only has 32 gigabyte of HBM memory or the number of flops on it is like 130th of H100 GPU. But what we knew is that if we put FPGAs are programmable gate arrays. Like think of them as-field programmable gateway, if you will. Exactly, yeah. So it's basically the best way I can simply. is like a C of transistors, right?

Starting point is 00:34:17 And you can shape how you want to structure the CF transistor however you want it. So you can basically... It's the versatility that's so important in FGPS. Yeah, exactly. You can inflict your... FPGAs. FGAs, yeah. On to your...

Starting point is 00:34:29 You inflict your hardware architecture on top of the FPGA and really show that as a proof of concept that your architecture is worth something to the world and it works basically. So that's what we did with Atlas is the point was it allowed us to get from the out of the blocks quickly, get a product out in 18. months from the launch and get an Atlas system out, which is a typical 4-U server. So this is a very standard 4-U system with 8 GPUs. This has kind of been the standard so far until the GB300 and 200s came about in the AI

Starting point is 00:34:59 accelerator world. And you can stack a rack up them. It's very, very energy efficient. You know, each server is only 2 kilowatts for comparison in an H-100 GPU, sorry, H-100 system is 10 kilowatts of power. And what you can do is today with our Atlas system, and this is in production today. We're shipping it out en masse in thousands of quantities. Basically, you can run transformer architecture workload,

Starting point is 00:35:23 so all types of LLMs. And what we've shown with our architecture is we can drive the same amount of memory bandwidth that we were talking at 35, 40% for Nvidia. We can drive over 93% of available theoretical bandwidth and use that. So even with FPGAs, which are, as I said, limited cards in terms of specs, we can drive a comparable performance to H-100s. And because we consume a lot less power and they're a lot less cheap,

Starting point is 00:35:50 that they're cheaper than H-100s, we can really say that we are performance per dollar and performance per watt. We are two and a half, three, three and a half X better than H-100 depending on each of the inference workloads. And if you're curious why he's saying performance per watt also, it's because, as we talked about at the top of the show, we are often not just compute limited, but electricity limited.

Starting point is 00:36:09 So both vectors really, really matter. So I'm glad I'm glad you brought up the ASIC versus FPGA point because I was curious why you started there. I know your next system is going to be ASIC based. But you said that it's a good proof of concept to show that your approach works. So essentially the way that I'm reading them, I just correct me if I'm wrong, is that you had an idea and you applied it via FPGAs to start because it's faster. It kind of quick and dirty, get it up, show that it works.

Starting point is 00:36:36 And then you take the same overall concept, map that to an ASIC takes longer, but you'll have higher more performance technology down the road, but using the same overall principles that you proved out with Atlas and field programmable Gatories. Is that fair? That is very, very fair. And the big part there is you get immediate customer feedback. So we have the systems deployed in production right now. So people are testing their workloads as we understand. Like, so, for example, you know, when deep C came out, the attention mechanism changed from MHA to NHLA, we got that real-time feedback. How does that impact our architecture? How our architecture performs for that performance, rather than just doing some kind of emulation or simulation,

Starting point is 00:37:14 you have real world performance feedback and numbers that you get. So that was a big part of getting the product out quickly. And then the second part is, you know, you talk about like a lot of, all of us hardware companies talk a lot about hardware. The other big kind of feature is software. You have to be really, really good at it. It inference the pressure of being CUDA compatible or being very easy is a lot lesser than on the training side.

Starting point is 00:37:38 on training side, you cannot, you know, NVIDE absolutely, like, you know, you cannot be not in the Kura ecosystem. On inference, it's lesser so, especially for bigger workloads, but still, you want to make it as easy where, you know, as long as you don't make people change their code, that should be your baseline kind of targeted,

Starting point is 00:37:54 you don't want to make people change their code. And this allows us to not only test that out real time, we can literally import that same stack over on tour ASIC because the fundamental element or the compute unit and the architecture stays the same, What you are really boosting on the custom silicon are the massive amount of specs because you get to go to the latest process node. You get to do real new memory architecture and technology and then you can really drive memory capacity and bandwidth up. So that's kind of our goal.

Starting point is 00:38:21 And then lastly, I'll say to just one point out, it also helps with getting investments because you can really prove out your chip works to customers and so on. It also doesn't hurt that it drives a good chunk of revenue. So Atlas as a bridge over to Asimov, which will be your A6 system that comes. out, I think you guys said, 2006. Yes. How much does an Atlas cost? How many of them are you selling? And are they mostly like proof of concepts with customers that want Asimov and are waiting?

Starting point is 00:38:47 Or are these people that really want to use Atlas for what it is today to run inference workloads? Yeah. So we have now booked revenue. Oh, sorry. So when I said booked, it means it has not converted into revenue because we have to produce and and ship it to them. But we have now booked in tens of millions of dollars for Atlas systems. So as you said, gives us real revenue.

Starting point is 00:39:06 And the big part here is that the people that are buying today are basically two major things. One is they're buying because the system already performs and gives them enough of performance leverage for their use cases that they're thinking, okay, this is worthwhile investment to get my return on capital in 18, 24 months. That's kind of the time period. And then the second thing is, you know, people want optionality, people at least want to give a chance to non-NVD accelerators. And I think this is where they're like, okay, this also serves as a test for your company. Can you productionize a system? First thing. Second, is your architecture worth anything?

Starting point is 00:39:45 Like, you know, does it actually work and does it actually scale out? And the way you predict and simulate and it really drives. So what we call, like, maybe we're using the term. Maybe we are using the term wrongly. But like, we call it the wrong socket strategy. It's like, hey, you buy Atlas, you pay for, you know, we make money. We make revenue. many profits. And most importantly, with Atlas, we're not saying that we're going to scale out

Starting point is 00:40:11 revenue to hundreds of the millions or billions. I think it's unfair to expect that off of the Atlas system. But what we really want is the customer list that will spend that much money on the Asimov and Titan system. And that's kind of how we really approach it. So we call it the wrong socket kind of approach there. Well, no, I really appreciate it. It's always good to see a company that has a vision, not just for their first product, but their second and the third. I mostly prepped on Atlas and Asimov Titan. I was like, that's going to come later. We'll talk about that next year.

Starting point is 00:40:38 But I appreciate that. Now, the customer mix is interesting here because I'm curious if you guys are selling these into the hyperscalers or if your early customer base for Atlas is more like, I don't know, companies that want to stand up their own inference stack because they're tired of paying someone else's margin on their compute needs. Yeah. So the two public companies that we have,

Starting point is 00:40:57 so the two companies that we have publicly announced are Cloudflare and Parasale. So Cloudflare is a contribution. EFRACEL is an inference of a service provider. So kind of different use cases, but they both are looking at it from cost effectiveness and with Cloudflare power effectiveness, because they have these data centers in metropolitan cities, they can't supply more power to it or can't liquid cool it, right?

Starting point is 00:41:20 So they need an air cool set them how to drive more tokens from a given amount of power. So that's kind of where they get interested. In terms of other customer bases that we have not announced, it is a mix of a NeoCloud, a hyper-skiller to your point, and, you know, we'll announce that in due time in coming months as we get to it. So, Lambda or a core weave, and then in AWS or in Azure, just to put some names to the

Starting point is 00:41:44 categories that you're describing, I'm not saying there's the actual customers. Correct. To the category. To the categories. And then also, we're now starting to get some traction in the financial trading ecosystem, right? So that makes a long sense to me. Yeah. So those are the ecosystems that we are really going after.

Starting point is 00:42:00 And again, the goal is to get them really. excited about what it represents for Asimov, because where we are going with Asimov is somewhere no other chip will be. So if you draw on a map, like, you know, memory capacity, memory bandwidth, and the compute ability, the big claim to fame for Asimov for us is going to have two terabytes of memory capacity. So just to give you a comparison, Asimov is launching end of 26. And Vilia will be on the Rubin generation, which is the one after Blackwell.

Starting point is 00:42:31 So that's the next generation that they will. will be around starting to ship it out then. Rubin initially will have 384 gigabytes. So that's 0.38 terabytes of memory. We'll have 2 terabytes. So you're looking at a 5x memory capacity differential between Nvidia Rubin and Positron as an office. Okay.

Starting point is 00:42:52 At the top of the chat, you said currently AI workloads are like 65% train, 35% inference. Yes. I presume that was 9010 a few years back. Yeah, a couple of years. So how long until it's 50-50? So what I do know is what the rate of inference spend is growing. So this year, roughly $105 billion of compute spend, like not data center, just the compute

Starting point is 00:43:15 spend that goes in an inference data center. That will be spent on inference, $105 billion. It's estimated that it will reach $300 billion, so $250 billion in 2027 and roughly $350 to $400 billion in 2028. So that's kind of where it will grow to the inference compute spend. The training spend is already well over. you know, $350 billion today. And the thing is, if XAI keeps on increasing the colossus, if Open AI keeps on actually spending

Starting point is 00:43:42 the Stargate full-on project, anthropic spends with Trinium, I actually see the training spend as a training spend go up about a $500 billion to $600 billion mark by 2028. So you're looking at a trillion compute spend on training. So it might maintain that ratio of $6040 for the foreseeable future or in that kind of territory. but there will be an inflection point someday. I don't know whether it's three years, five, or ten years, but it will have that. That's very interesting because listening to you talk,

Starting point is 00:44:10 I was like, why isn't Nvidia trying, you know, getting off of their GPU train and having a second effort to just focus on inference because what you guys are building and what Etch is working on and a couple of other companies really seems to me like the right approach to handling inference compute

Starting point is 00:44:25 because you don't need to have the same stack that you use for training precisely. But if the training market is going to stay, the majority case for AI compute for the next couple of years. Never mind, I take it all back. Invita is barking up the right tree. But in 10 years, though,

Starting point is 00:44:39 we're not going to, I presume, we'll spend a lot more on AI inference than AI training. And so I think that long term, you guys are going to end up with the majority of the market. It just may be a little further out than I thought, I guess. Well, yeah. And look, two things. I think I'm going to be grounded about this

Starting point is 00:44:54 and actually not even humble just for the sake of being. I'm going to be very humble about it. And Vinduad not only is the most valuable company in the world, They are one of these smartest companies in the world. It's not like they do not understand this differential between where the training workloads and inference workloads and efficiencies are, right? What they're saying is there is no option in the market today that can even compete with the GPUs on inference. And they're absolutely spot on, right? Like, that is true.

Starting point is 00:45:18 Like that is actually a true statement today. And they're saying, so why divert our margin stack where they make 75% margins on this very high-end cards to like, you know, to create cheaper or more efficient inference? I'm not saying that they're not trying to make it more efficient. And then the second part, which is where the humbleness comes in is the proof is in the pudding. None of us or Esch or any other company for that matter. Literally zero other companies have proven out a better cost efficiency setup than Nvidia today in the market. Like we can claim for Atlas, for example, we can claim it for certain transformer models or certain types of transfer models. But if you look at the overall market, no one has come out and said, yeah, like, oh, everything, you know, we can beat Nvidia across the board.

Starting point is 00:46:00 because no one can. And even for the foreseeable future, that's not going to be the case. It's about finding the right niches, like for example, for Asimov, 2 terabyte of memory. What does that immediately get us? You can fit a Frontieran model, so like Brock 4 is expected to be $2.4 trillion. That's what the rumor mill suggests. You can fit that on a single chip of our card of Asimov, whereas you need like four or five GPUs to fit that weights of the model.

Starting point is 00:46:25 Or video generation. Today video generation is limited to like eight second or 10 second or 20 second clip. with that much memory, now you can generate a continuous video to a minute, two minute, you know, even multiple minutes long.

Starting point is 00:46:36 So you have to find your use cases where you stand out against NVIDIA and go for that. And this is also why NVIDIA wouldn't be like, hey,

Starting point is 00:46:43 we're going to create a single chip for every application because why are they going to sacrifice the margins for their own? But the cool thing is because NVIDIA is going after what is current

Starting point is 00:46:51 the majority case and the high margin case and the proven out case and the known to make a lot of money, but they're going to leave some gaps that you can step in. Absolutely. Yes.

Starting point is 00:46:58 So I want to ask about that because for fun. I looked up Nvidia's R&D spend for the last quarter and it was $4.3 billion.

Starting point is 00:47:06 Your last round was about $50 million. So about like, I don't know, 27 minutes of Nvidia's R&D spend. I know companies like Valor are backing you

Starting point is 00:47:15 but I'm curious are you guys going to need an order of magnitude more money to pursue the Asimov Vision and then putting them together

Starting point is 00:47:22 to make Titan or is it going to be more capital efficient than Nvidia's R&D spend might lead me to think to get you through your

Starting point is 00:47:28 2026, 2027-ish road that. Yeah, two things on that. First, an internal kind of introspection. We are very capital efficient. So we got our Atlas out with only a few single digit millions of money raised. And with Asimov, actually, and look, Valor is a well-known fund for their diligence. They looked at our capital spending plan. They like capital-efficient companies. And, you know, when they looked at it, and they saw that with the Series A, we could tape out our Asimov chip. So you are right, generally, or traditionally, if you look at Silicon companies,

Starting point is 00:48:03 they have raised $300, $400,000 to tape out a single chip. We are going into the market with a total raise so far of $75 million, including our seed and Series A. And we're saying that we're going to tape out our first generation ship in that $75 to $80 million kind of strength. And then to your point, after that, when it comes to production scaling, it doesn't mean that we are done raising.

Starting point is 00:48:23 I would be lying to you if I sit here that saying that, hey, like, we're not planning on raising more. We absolutely will raise more. but I think that will come for scaling off our asthma into Titan systems, as you said, and really going out to market. But, you know, we don't, like, personally, we have a very strong, personally, both my strong belief and the company, we have a very strong belief that, you know, you're only showing market validity and valuableness if you are both doing that with a very huge amount of capital

Starting point is 00:48:52 efficiency. And all, and your chip is only worth it if the market wants to pay for it. What I mean by that is what we really want is third parties and, you know, hyperscalers and cloud providers and companies buying our chips. What we don't want to do is like, just like, hey, build out our own cloud and then sell token as a service, right? Because that's kind of where the economics values, whether the companies are economically profit or not is not clear. So we really, like we want to do what an AMD does, which is sell to third parties, really. And then that's kind of where the vision goes.

Starting point is 00:49:23 And if you're signing the third parties, which means you're making revenue, which means that profit can, fund your future tapeouts. So that's the introspection side of things. With NVDA's 4.3 billion number, sorry, just quickly addressing that. Obviously, they're not just spending that on their silicon. Nvidia is a company that is just even, you know, it's not just a silicon company. I know that $200 billion worth of their revenue is driven by Silicon, but they're spending R&D on robotics, on networking.

Starting point is 00:49:53 Actually, one of the key things. Self-driving with their Thor system, et cetera. Yeah, and one of the things that I was, like, I genuinely think where their R&D expense has been really worth it, is looking at their at least projection of their optical networking and what they have the CPU, that they have projected at the GTC. It was really phenomenal to see what they're kind of scratching on the surface there, right? So they spend the money on a lot of things. So actually, again, I come back to it. I'm very both obviously like I'm very bullish about Positron, what it can achieve from market share perspective,

Starting point is 00:50:25 because the market is like going to be $300 billion. You know, we're going to achieve a big part of that inference market. Obviously, that's the goal. That's what we're trying for. But I also see kind of where Nvidia is spending R&D on different areas of domains of market. It's not just training. It's not just inference. You know, you talk to a self-driving, robotics, other things as well.

Starting point is 00:50:42 I guess what I'm trying to drive out is like, is there enough private capital in the market that is available to companies like Positron to do the work you need to do to earn your spot in one of these niches that we're describing? Like, does Valor and friends have enough gumption to give you the capital you need when you need it? Because there has been a little bit of a sentiment shift about AI in the last couple of months. I think GPT-5. I like GPT-5. Me too.

Starting point is 00:51:07 I love it. Less impressed. And so it makes me wonder about, like, how open the wallets are as we look into 2026. Yeah, I think the interesting thing is, like, you know, obviously if I speak about valor or, like, I have to give them kudos. And it's the same thing with Valor and Atreides, you know, Antonio, Gavin. And these guys, they backed un-fashionable thesis early with us, right? You know, like, you know, the fashionable thing is obviously, Envidia is going to keep on being the 95% of the market or 92% of the market.

Starting point is 00:51:36 But even if there the market is growing so rapidly that the rest of the market is still pretty large. But the second thing that I will say is the answer is absolutely unequivocally yes. Like, you know, even just like, you know, like if you look at Valor and Treatise and DFJ, who are backers for Series A, generally these are growth. They generally back companies in Series B, Series C, like when you're raising growth rounds. They came in early in Topositron. And that's not because they just fashioned one day that we want to be early stage investor. I think it's because they basically decided that, hey, this company, if it actually proves

Starting point is 00:52:11 and delivers and executes on what they're showing and promising is actually going to be on a very fast growth curve. And we're going to be there to back them up. Because, again, that's where their actually main capital comes in. is especially Valor, if you look at Antonio's thesis, right, he doubles down every single time on a company once they show execution and success. So we have no doubts about that, both on our ability to prove it out to the market and that there's capital available for us to keep on growing and really finding a very, very good

Starting point is 00:52:45 revenue stream for us. Okay, one last question before I let you go. And it's just about the durability of transformer-based AI. Like, you guys clearly have made a directional bet on Transformers staying at the forefront of AI progress. And as I alluded to a second ago, some people are a little bit worried about training walls or, you know, token price is not coming down as we burn more of them in reasoning models. And there was that MIT study that everyone, I think, kind of misread, but took to the fences. So tell me a little bit about your just your and Palsodon's confidence in Transformers staying at the heart of AI. And hopefully, as well, that there's still good performance gains.

Starting point is 00:53:23 to be had out of transformed-based LLMs in the future. Yeah. Can I say one thing to begin with in terms of clarification, though? Like, we are fundamentally a linear algebra accelerator, right? What we are accelerating is the matrix multiplication. And so, and, you know, if you look at back 30 years ago, and I think if you look forward 30 years ago, the fundamentals of deep learning will be in the matrix multiplication. So linear algebra will still be the driver of the different, whether it's transformer architecture,

Starting point is 00:53:51 whether it was CNN, and whether it will be what a little. of the future architecture of the world is. Like I would... Are you working towards saying that I am being overly specific by saying Transformer, but really what I'm talking about is just matrix multiplication? You are being, I'm saying you are being overly specific that our marketing word is, yes, we focus on Transformers because that's where the use cases are. And that's where we actually really stand out as well.

Starting point is 00:54:13 But if tomorrow something changes, like, you know, a new model architecture comes out. For us, we are not actually specifically tied to Transformer. What we can do, we are, you know, just like GPUs can adopt a new architecture, we absolutely can adopt the new architecture. It'll just be we have to, we'll have to spend more time on instruction sets and, you know, software data set and stuff. But that's a software point, not a hardware point. Correct.

Starting point is 00:54:37 Yeah. From, from an architectural perspective, we can actually serve much more wider. Like, even today with our Asimov and with our FPGAs, we can serve diffusion, we can serve other types of models. It's via focus on transformer because that's where, you know, over 90%. of generation or inference use cases today lies. And also, that's where people are really deploying it. So from our side, we're making some decisions to optimize for it. Like, for example, big transformer models require lots of memory.

Starting point is 00:55:03 So we are going and chasing after having a lot of memory on our chip rather than... So you can run those two trillion-per-mendor models. Yeah, exactly. On a single chip, exactly, right? But from an architectural perspective, we can actually run much more broader ecosystem of models for inference, not training. well um normally i i end these chats with uh well thanks for coming on where can people find you online and what's a job you're hiring for that you're having a hard time filling but instead i'm going to ask you this in my touch um can i give you five dollars in your next round absolutely yes

Starting point is 00:55:34 to be clear i'm kidding i'm just i'm very impressed and i'm also i'm also very i'm still a big i i'm still a big ai bull yeah and i think that when i think about the asmov's you know system being put into like groups of eight and Solis Titan and so forth. It's all, it just strikes me that we're going to have a future that's going to be a little bit more efficient than people think. We're not going to boil all the oceans. We're still going to have power for our appliances. And we're going to have essentially infinite intelligence in our pockets and all of our screens.

Starting point is 00:56:01 And to me, that's a pretty exciting future. And we better. We will need to have infinite intelligence. And that's the whole goal. Like if you ask Nvidia, if you ask us, if you ask any other Silicon company, I think they're all trying to say that, you know, we are, we are approaching it with some different approaches, but basically it's like, how can we bring cost per token or cost per generation down so that we can afford to have infinite intelligence while increasing our energy production

Starting point is 00:56:27 as much as we can? Well, I'm totally here for it. I appreciate it so much. It's positron. com. If people want to go take a look at it and in Mattes. Yes, please. When Asimov comes out, will you come back on and tell us how it's going?

Starting point is 00:56:37 I absolutely. I would be the happiest person to come back on because that would actually put us into an untouched territory, as I said, in terms of a couple of the aspects that we're going. you're targeting there. Late 2026, you said, right? Correct. We are taping out Q3 end of 2026 and then production. So initial sample systems will be late 2026 and production scaling in early 27.

Starting point is 00:56:58 That's kind of the schedule. Well, that means you have quite a lot of homework to do. I'll see you in 365 days. Thank you so much. Thanks for coming on, man. My pleasure. Thank you so much for having me.

This Week in Startups - Republic opens private markets as Positron takes on GPUs: A TWiST500 doubleheader! | E2176

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.