This Week in Startups - Vertical AI Innovations: Abacus LLM in Finance & Synthesia's Video Avatars | E2112

Episode Date: April 15, 2025

Today’s show: Alex interviews founders from Abacus and Synthesia—two companies leading the charge in vertical AI. Abacus is deploying on-prem LLMs to banks and insurers with a focus on data contro...l, accuracy, and integration with legacy infrastructure. Synthesia just hit $100M ARR by helping 70% of the Fortune 100 turn internal documents into scalable AI video. If you're interested in how generative AI is moving from hype to real enterprise utility, this one's for you.Timestamps:(0:00) Introduction to vertical AI with Abacus and Synthesia(1:57) Real-world applications of AI in regulated industries & Interview with Abacus founder David Moscatelli(5:02) Unique challenges for banks and credit unions using AI & Abacus OS private LLM(9:56) Atlassian - Head to https://www.atlassian.com/startups/twist to see if you qualify for 50 free seats for 12 months.(11:57) Abacus features, benefits, and customer acquisition strategies(12:27) Decentralized indexer, data integration, and voice AI technology with VAPI(19:58) Vapi - Go to vapi.ai/twist and get 1000 minutes free per month - for life.(24:47) On-premise setup, combating AI hallucinations, and trust in AI responses(30:08) Hubspot for Startups - Visit hubspot.com/startups and join the founders who are turning growth challenges into opportunities.(33:40) Abacus company growth, fundraising, and sales hiring(38:25) Introduction to Synthesia CEO Victor Riparbelli, AI avatars, and building a startup in Europe(41:34) Synthesia’s market positioning, target use cases, and personalized avatars(46:33) Synthesia's training data, partnerships, and improvements in AI avatar technology(55:57) Synthesia’s growth, Fortune 100 companies usage, and financial aspects(1:00:21) AI model competition, market appetite, and enterprise focus(1:02:09) Future of AI in video creation, ethicalconsiderations, and Synthesia's plansSubscribe to the TWiST500 newsletter: https://ticker.thisweekinstartups.comCheck out the TWIST500: https://www.twist500.comSubscribe to This Week in Startups on Apple: https://rb.gy/v19fcpCheck out:Abacus: https://www.goabacus.co/Synthesia: https://www.synthesia.io/Follow Alex:X: https://x.com/alexLinkedIn: ⁠https://www.linkedin.com/in/alexwilhelmFollow Jason:X: https://twitter.com/JasonLinkedIn: https://www.linkedin.com/in/jasoncalacanisThank you to our partners:(9:56) Atlassian - Head to https://www.atlassian.com/startups/twist to see if you qualify for 50 free seats for 12 months.(19:58) Vapi - Go to vapi.ai/twist and get 1000 minutes free per month - for life.(30:08) Hubspot for Startups - Visit hubspot.com/startups and join the founders who are turning growth challenges into opportunities.Great TWIST interviews: Will Guidara, Eoghan McCabe, Steve Huffman, Brian Chesky, Bob Moesta, Aaron Levie, Sophia Amoruso, Reid Hoffman, Frank Slootman, Billy McFarlandCheck out Jason’s suite of newsletters: https://substack.com/@calacanisFollow TWiST:Twitter: https://twitter.com/TWiStartupsYouTube: https://www.youtube.com/thisweekinInstagram: https://www.instagram.com/thisweekinstartupsTikTok: https://www.tiktok.com/@thisweekinstartupsSubstack: https://twistartups.substack.comSubscribe to the Founder University Podcast: https://www.youtube.com/@founderuniversity1916

Transcript
Discussion (0)
Starting point is 00:00:00 Hey, everybody. Welcome back to Twist. This is Alex and I have two amazing interviews for you on this very fine Tuesday. Thematically, they are linked because we are talking about the application of AI to particular sectors. This is often called vertical AI, kind of building off the idea of vertical SaaS, but taking generative AI tools and applying them to one particular industry. First up, we're going to talk to Abacus. Now, this company wants to bring Gen AI into regulated industries, think credit unions, banks, and insurance companies. It's very interesting to hear how they're going about approaching that market, and it's quite different than other AI-first companies and how they're going after their own markets, especially if they're more consumer-oriented.
Starting point is 00:00:40 Then second, we're going to talk to Synthesia, a company that I actually demoed on the show a couple months back. They're a Twist 500 company, and they're working on basically video-generated AI models of people for things like trainee videos and so forth. Well, they have a pretty important revenue milestone to talk about and some very cool technology, and they have notes on where people are buying Gen A technology inside the enterprise today. So if you're a little tired about hearing about the newest model this
Starting point is 00:01:07 and the newest benchmark that, I want to know where does the rubber meet the road? Well, here you go. Let's start with Apicus. This weekend startups is brought to you by Atlassian. From MVP to IPO, Atlassian for startups provides your team the right tools to plan, track, and collaborate on work.
Starting point is 00:01:26 Head to atlassian.com slash startup slash twist to see if you qualify for 50 free seats for 12 months. VAPI. Add real-time AI-powered voice conversations to your apps or business in minutes, not months. Go to vappi.ai slash twist and get a thousand minutes free per month for life. And HubSpot for startups. Smart founders aren't piecing together random tools. HubSpot is the customer platform that thousands of startups use to scale efficiently. Get up to 75% off plus three months of perplexity AI for free. Go to HubSpot.com slash startups.
Starting point is 00:01:56 Go back in time, a couple of quarters, maybe a year, a year and a half, everyone was curious. What are all these AI technologies going to be used for? Will they have a real-world application? Are they just very fancy toys that cost a lot of money to run? Well, the market has answered that. The answer is yes, AI models do have a lot of value. Bringing AI to the enterprise does have real grit to it, but some companies are taking this in a different direction.
Starting point is 00:02:20 Instead of building a tool or a model that works for everybody, they're going niche and trying to take one industry down to the studs to build something just for it. One of these companies is Abacus, which you can find at goabacus.com. We're going to talk to founder David Mosquitelli about what he's building and why he decided to go after a regulated industry so hardcore that I think he must be at least half insane to have selected it. Please welcome to the show. It's David.
Starting point is 00:02:45 David, how you doing? How's it going, Alex? I am part insane. That is true. So I appreciate the intro. I don't have to surprise anyone. We were just talking before we hit record. and I learned that you're based in Chicago,
Starting point is 00:02:55 and I have done for Chicago winters, and I have gone through snowpocalypse. I have lived the lake effect. I learned about depression. So just give Chicago a shout out for me and tell me why you're building in my beloved windy city. Yeah, Chicago's great. I've tried to ask my parents tactfully before,
Starting point is 00:03:15 why do we live here? Why, of all places, why did you pick this? Then they've never been able to give me a good answer. But in Chicago, we're building a company called Abacus. So Abacus is generative AI for regulated industry. Think Bank, credit unions, insurance companies. And what we like to say is Abacus is where AI meets Assurance. So we're really about helping enterprises not only have an LLM solution with our on-prem solution,
Starting point is 00:03:41 but also helping them with response control and then anthracillusionation. So we have a whole package platform for enterprises, and we're doing quite well at that. from a relatively high level, instead of making a wrapper around something that open AI built or anthropic or pick a company, you guys have your own model that you're taking to the financial industry, for lack of a better broad term, which is very much regulated. And also, in my personal view, just not hyper technology savvy. Like, I mean, we joke about checkbooks. Well, who was sending those out still, right? So am I wrong to think that the world of financial services is relatively behind the times because it feels like you picked a difficult niche.
Starting point is 00:04:22 I joked in my intro, but I'm honestly quite curious about their posture towards adopting, you know, technology let alone generative AI on the leading nut. Yeah, so it's a good question that you ask. And, you know, banks, I will say that the first bank I ever worked at is a true story. They gave us a package of red grease pencils to mark up bank statements to do reconciliations. True story. So you're not completely wrong when you say banks are sort of behind the technology curve. But I will say they've started to recognize this is a real serious need for their business,
Starting point is 00:04:52 for their customers, one that they can't avoid. But they want to do so in a bank-like way. What does that mean? Very little risk, lots of control, right? Something that they can really have a lot of purview over. And so that's why Advocas has kind of come up as a solution for them. And they're really looking for, if you think about it, Alex, looking for three things when they want an LLM or any kind of piece of software.
Starting point is 00:05:15 They want something that they don't have to worry about their data. So they want something on-prem or very secure. They need something that's going to index all of their data, right? So if you're a bank or a credit union or insurance company, you know, they don't have one place where all their documents and data are, right? They have tens of thousands of documents and they could have hundreds of different data sources, right? You know, how do they bring all that together to index for an LLM, right? So they need that.
Starting point is 00:05:39 The second is the answer has got to be accurate, right? You can't ask an assistant what's our mortgage rate and have the assistant give the wrong rate. now you've got a problem, right? It creates all sorts of compliance risk, right? Yeah. And then you need response control, right? So, Avocis or any virtual assistant like software might give a response, but maybe that's not the response they want that piece of software to give.
Starting point is 00:06:00 So they need some way to control the responses whenever they feel appropriate. So those three things, you know, in place, as I've described them, I think banks are more willing to explore this avenue as a potential solution for not just internally, but their customer. You know, we ask a lot of companies that jump on the show, you know, what's your moat? And it sounds like really in this case, getting a product set up to work inside of an industry with those very strict regulations is a moat in of itself because who wants to go do all that work? It sounds very, very complicated. But instead
Starting point is 00:06:32 of going product to the model, let's start the route and then build up. So you guys have made something called the Abacus OS private LLM. It's a 30 billion parameter model. And I'm not going to David, when I hear we trained a model, I just imagine a crater in the ground with just cash burning coming out of it. Because that's the narrative we hear, right, from a lot of folks. I know you can do it cheaper. Data bricks has, et cetera. But tell me about this model you trained, how you went about it, and just the cost books. I'm sure a lot of founders listening are curious about the process and the why. Yeah, so we started with a base open source model, which is the Mosaic 30 billion brand model, right? Sure.
Starting point is 00:07:10 And then we have a lot of data. So from all of our clients, right, we have about six million queries every single month that come in. So we have a ton of data in terms of the questions asked by our clients. And this data is very specific, right? So folks ask, hey, how do I open a new debit card in DNA? Right. If you ask any general LN that question, they're going to think DNA means the genetic structure. Really, if you're a banker or credit union, you know DNA stands for FISA, right?
Starting point is 00:07:36 Things like that that we're able to extract out and then fine tune and layer on top of that base model. some really impressive enhancements, right? The second thing we do is at Abacus, we have what we call a parent model. It's a very large, very bloated model. This is a lot of different things. And then what we deploy on the bank's infrastructure, credit union insurance infrastructure,
Starting point is 00:07:57 is called a sister model. And the parent model teaches the sister model how to behave, right? So the sister model mimics the parent's behavior. This is often referred to as knowledge distillation. Okay, I was curious. I was like, wait, this sounds incredibly familiar. So you made the enormously expensive, ridiculous, oversized, hard-to-use thing,
Starting point is 00:08:17 and then simply let that train the smaller model, which is cheaper, faster, easier to run, and therefore is a better fit for a GPU cluster you might find at a bank versus one at, say, an Azure data system. Exactly right. So that means all of the compute that the on-prem models eating up. The cost is actually not even material, right? It's so small because if you think about it, I hate to deemist divide this for you. But an LLM model is just.
Starting point is 00:08:41 the CSV file with weights, right? To get those weights, that's where the expensive training comes in. And then there's some software to then run those weights, predict the next series of words and assess to get the query, right? So, you know, at the end of the day, that's what we need to run when we're deploying. And so that sister model is very lightweight. It's completely on-prem within their control, you know, and so that gives us the flexibility to then have that within their infrastructure.
Starting point is 00:09:05 This allows them to do two things as far as I can tell. One is you have an enterprise search function, essentially an AI-powered search product, lets people go into their own data and pull things out. Some companies are working on this. I think Glean is doing this for like the generic enterprise. I've actually used that once at a company that bought it and it was medium. Maybe it's gotten better. Sorry to Glean. That wasn't very nice. But I was not blown away by it. And then the other thing you're building is a thing called Abby ABBI, which is an AI agent as far as I can tell that you have tuned to work in a Fincer call center-like environment.
Starting point is 00:09:43 Exactly right. So if you think about Abby, and our company is called Abacus, but some of the first clients, they kept referring to Abacus as Abbey. And so we sort of adopted that name Abby, so we call it Abby Assist. All right, if you're shipping a product
Starting point is 00:09:58 or rolling out an update, you're building a company, you need to be organized, right? We know that. Atlassian has exactly what you need to streamline your work and smash her goals. And the Atlassian for Startups program is packed with all the tools you need, like Jira, where you can track every task sprint and bug. That's the industry standard. Confluence, another industry standard for team collaboration and documentation.
Starting point is 00:10:20 And of course, Lume for quick video explainer creation. Now, Lume is really brilliant. My team started using Lume on their own. They started paying for it on their own. Why? They wanted to get credit on the investment team for communicating to me, the general partner of the firm, why they wanted to invest in a company. So they would do a loom where they recorded over a recording of an interview they did with
Starting point is 00:10:44 a founder or visiting their website and going through why they want to invest in a company. And this was so great for me. I would be skiing in Japan. I would be on a flight to New York to see my parents. And all of a sudden I get a notification for one of my team members. Hey, watch this loom. And I get the link for the loom. I click it.
Starting point is 00:11:00 And then I can put comments at any time. So it's like doing a conference call. but on my time asynchronously, and I can communicate right there on the video. Also included in Atlassian for startups is Compass, Gera Product Discovery, BitBucket, so much more, all powered by Atlassian intelligence. That's their built-in AI.
Starting point is 00:11:19 Atlassian software helps companies like Canva, Cloudflare, and Rivian, keep growing and keep innovating. Whether you're brainstorming on sticky notes or scaling to the big leads, Atlassian is here to accelerate your startups growth. Check out Elassian for startups where eligible startups get, up to 50 seats for free for one full year.
Starting point is 00:11:38 That is absurdly generous. Why can they be so generous at Atlassian? Because they're the standard. Atlassian is the standard. And they are generous to startups because they were once a startup. I remember meeting them 20 years ago in Australia. What a great company. Head to atlacian.com slash startups slash twist for complete details.
Starting point is 00:11:57 And Aby assists is sort of that front end chat, BT like interface that bank, credit unions, insurers use to answer questions. And this is the kind of thing that, you know, if you're thinking about servicing a client or a customer, you're on the phone with a customer, you're in front of a customer and you're trying to get information to them. So there's the Abbey agent, which, you know, allows folks internally look up information. And Abby agent has a whole bunch of things, you know, sync to it. So they have compliance guard. Then we have chain of validation. We could talk about those.
Starting point is 00:12:28 But then the piece that you mentioned is really important is the indexer. So Alex, I spend eight months of my life. building software that just connect one piece of data to the next. It's very painful. I don't want to relive that, but that was my life for a month. And so we connect to a lot of data sources, but what's really important is we connect to the really hard one. So you mentioned Glean. Glein's a great company. You know, they connect to a bunch of different data sources, but the ones we have to connect to you for our industry are things like FISA and CIMATR.
Starting point is 00:12:55 Very complicated core systems that require due diligence and compliance checks in our end. And I presume, unlike linking into Slack, there's not, an amazing API perfectly built just for you to drive developer adoption. Exactly right. So we helped to solve that problem, safety, and security while connecting those core systems. The piece about our indexer that's really important is it's decentralized, right? So most of our competitors, Alex, are going to say, hey, we have this AI assistant, take all of your documents and upload it to this one location and we'll index it and then
Starting point is 00:13:26 we'll give it the answer. And if you're an enterprise, that's just not realistic, Alex, right? So what's unique about our indexer is that it goes out where the information lives today, indexes that data and brings it together. So there's no internal alignment meetings or changes and processes and procedures. That makes sense? Yeah, but the index, you're just to make sure that I'm fully tracking here,
Starting point is 00:13:44 because we're talking about on-prem is running inside of the bank, credit union or insurance companies own systems. So you hand them this piece of software and then they go off and run it on, okay, this is making sense to me. But it all stays inside. It never breaks the firewall containment. That's right. The indexer is completely inside their infrastructure. The on-prem model is on-prem completely in their infrastructure. So they really have that piece of fine and sense of security when using Abacus the product. Yeah. Now, when I was prepping for this chat, because I love to dig into how people actually run their service, you have a section in the Abacus OS LLM model part of the site discussing how you can run it on a handful of H-100 GPUs or I think also A-100s, yes.
Starting point is 00:14:28 Yep. Reading this, talking mostly to technology founders, getting access to a small allocation of GPUs is not an impossibility. Yep. And the know-how to set them up and properly use them, also not an impossibility. When I think to my childhood credit union, which had great lollipops, by the way,
Starting point is 00:14:47 love that. Still remember that. I remember where it was my hometown for some reason. I don't think they have the juice to get a GPU rack going. So my concern is this sounds awesome, but it also sounds like something that only applies to the largest banks, credit unions and insurance companies. Because to me, smaller people probably just don't have the gear for it. Is that right? Or am I being too pessimistic and behind the times? Alex, this is why I love you because your questions are right on. So your answer is correct, but I want you to inverse it. So if you're a big bank, think Chase or Wolf Bargo, you fill in the blank, right? those guys have the money, the talent, the infrastructure to go ahead and then get the best
Starting point is 00:15:29 infrastructure that the money can buy. But if you're a credit union or a mid-tier bank, you don't have the talent or the infrastructure to do it. That's why they need a solution like Abacus, right? So even if they, you know, figure it out a way to do it, it's not a simple plug-in-play, press a button that deploys to their infrastructure and they're good to go, right? There's a whole bunch of maintenance and compliance things around that. So you're absolutely spot on about, you know, there is a lift, a technological lift to get people there.
Starting point is 00:16:00 And that's why, you know, a lot of the folks, mid-tier credit unions and banks that need that support. That's why they turn to abacus. What is the deposit base of a mid-tier credit union? I have no idea what the tick marks are on the axis we're describing here. Yeah, another fantastic question. So I would say the sweet spot is between a billion and $3 billion in total assets, which if you're at JPMorgan, Wells Fargo, it probably sounds adorable.
Starting point is 00:16:26 Yeah. Right. Oh, there's a little baby bang. Yeah. But those are the folks that really, they have the money, but they also, you know, they have the infrastructure, you know, to think about doing something like this, but they don't have necessarily the technological capability or the staff or human capital
Starting point is 00:16:43 go ahead and deploy it. The net interest margin on, say, $3 billion, let's call it, it's a credit union. So it's a little bit lower because they're better to you than banks are. call it 2%, what is that 60, 60 million a year. So they probably have like a million dollar a year IT budget, give or take. So that does put a cap on your ACV for that type of client, but it still look a very attractive business. So how hard is it to land those mid-tier credit unions and banks and bring them into the Abacus fold?
Starting point is 00:17:11 Or sorry, the Abbey fold. Yeah. So it's a good question. And this is where if you haven't already, Alex, you're going to be like, well, this guy's insane. So I learned early on that, you know, if you're a customer of these banks, You know, because people always ask me, David, how did you get these clients? How did you get your clients?
Starting point is 00:17:25 This is a very hard industry. Did you open accounts at all of them? You are, there you go. So I'm probably on some, I'm probably on some government list, right? I'm being watched by the feds because I probably have the most open bank accounts of any person in the U.S. So I would open a bank account, deposit the $5.
Starting point is 00:17:41 And I'd email the CEO and I'd say, hey, I'm a customer. And I promise you, if you're a founder listening, that tactic works like 95% of the time. If you email them and say you're a vendor, you almost never get a response because they get those emails all day long. But if you say I'm a customer, it's like a 95% response rate. So that's how I would sort of get my foot in the door, which I know sounds insane and it is
Starting point is 00:18:01 insane. I don't recommend it for everyone. But that's kind of my hacky way of going about it. And then the second way, Alex, that we get them to use abacus is, you know, I tell them, and I'm going somewhere with this, I don't want them to use abacus unless they absolutely love it, right? I want them to completely love the product. And I wanted to do everything that we've told them it does.
Starting point is 00:18:22 So we give them three months, money back guarantee you, right? We let them try the product. You know, it's not something I want them to use. If it's not something they absolutely love to use and it's, you know, we take this as a matter of personal pride, company pride, you know. We want our products to be tremendously useful in, of tremendous quality. So, you know, that's how, you know, we're able to sell ourselves. It's not just, hey, you know, we're a customer of yours or we have a strong interest
Starting point is 00:18:47 in you doing well. But, you know, our product is here to help you and we're going to guarantee it. It does that. From a business perspective, I can kind of see this two ways. Because on one hand, if you offer a three-month money-back guarantee, people will take a greater risk on trying you out because the bar to entry is effectively lowered. Right. And also, you, by giving them this out, effectively say, we're going to let all of our problem customers go away immediately. And if we're not a fit, we're not going to try to fight for retention here because why bother?
Starting point is 00:19:15 On the other hand, a lot of companies don't like to let revenue go away. So I guess for founders out there, this sounds like a short-term risk, long-term benefit setup. Is that fair? Exactly right. So what I like to tell all of the avicrous folks, everyone that we work with and all of our employees is the answer is never no. It's just not now. Yeah. We are here to serve our clients and our customers.
Starting point is 00:19:40 And that's our focus is making sure we're delivering a product that is really what they want. And while it seems like an inherent risk, gosh, you're going to give away three months for free, you know, I think that's the ultimate test. If your product is what you've said it is and it's a quality piece of software, I don't think you have anything to worry about giving the three months. 2025 is going to be the year that voice AI really breaks out. Voice AI is going to change everything. And this technology is perfect for building a chatbot or automating customer calls and basically creating these next gen voice experiences. But building with these tools is so difficult. It's just a bear. It's going to take your developers months to
Starting point is 00:20:20 build and test anything even basic unless you use VAPI. VAPI. This is the fastest and easiest way to integrate voice into your products. It's just the most simple API you've ever used. It helps you build, test, and deploy voice agents in just minutes. And you can create ultra-realistic AI voices and customizable personalities. I like that part, all with real-time response speeds. You know, it'd be great. Maybe I just demo it. Okay? Listen to this. Hey, Jason, it's great to be on your podcast. It seems like VAPI has figured out how to be very low latency. Maybe you could talk a little bit about why that's important. For sure. Low latency is crucial for a smooth user experience, right? It makes conversations feel more natural and engaging. People expect real-time interactions, especially with the voice assistants.
Starting point is 00:21:08 At VAPI, we've optimized everything to keep that response time around one second. It's like having a chat with a friend, not waiting for a robot. to catch up. Can you do other voices and personalities? Right now, I'm rocking my CEO's voice, but with VAPI, developers can bring their own voice models. So if they want to add some walk and flair or any other personality, they totally can. I mean, who wouldn't want a little drama in their AI conversations? Just think of me as your friendly AI buddy, keeping it real. If you want to give it a spin yourself, and trust me, you should. Just head over vappy.a.a. slash twist. And Twist listeners will get
Starting point is 00:21:42 1,000 free minutes per month for life. That's F-R-E. Free. What an amazing price. Again, that's VAPI. dot AI slash twist. We've never had a customer
Starting point is 00:21:52 take that three months and not convert to a full-time paying customer. So I know it's adorable because we have nine paying enterprise clients, so that sounds adorable. But, you know,
Starting point is 00:22:02 we, you know, for us, we've never had a conversion not go well. Nine enterprise customers for a company of the age of advocates because it's only a couple years old, right? That's right. That's right.
Starting point is 00:22:13 Yeah. So, no, stop, stop talking yourself down. That's fantastic. There's unicorns out there right now who are like, we would kill for nine customers. So I think you're doing fantastic. Now, what is the ACV here? We're talking about, you know, FinServe. We're talking about self-hosting, talking about data fine-tuning. There might be some handholding there. I'm presuming this is not 20 bucks a month. No. So, you know, what's our pricing schedule look like, how do we price our product, right? We have both a monthly subscription and a one-time installation fee. Now, it's based on asset. So our monthly fee is for anywhere from 10 to 15,000 per month. And then the one-time installation is 250 to 350,000. It's a one-time payment.
Starting point is 00:22:56 And it's usually amortized over the like contract. Average contract is roughly three years. Okay, so I'm looking at about 100 a year for the spread out installation cost and that I'm paying some number of bips off AUM for the rest of it. That's right. That's right. So we usually get the installation fee. It depends on the client. Sometimes it's paid up front. Sometimes it's amortized.
Starting point is 00:23:18 But I would say 15,000 is surprisingly more common than I mentioned that range. We normally inch toward the 15,000 per month. How much capital would you need to raise to be able to just waive the installation fee? and just go straight for it and just go faster and land even more enterprise accounts by getting rid of that roadblock. Everyone has to be this question. So the installation fee is actually... Okay, tell me why. I want you to think of the monthly fee as mom and the setup one-time installation fee is dad.
Starting point is 00:23:55 Dad's grumpy and he's really strict and mom is really nice and wants to give you everything she can, right? So you play those off of each other. It allows us to be flexible with our pricing and allows us to the... negotiate, right? So if you have two the numbers, one's a monthly fee and the other is a really high numbers, seems, you know, very large. And all of a sudden, you bring that number way down. All of a sudden, the customer, right, sees a tremendous amount of value and goodwill toward them as a client, right? So it's a very strategic value proposition to have both, if you just have one number, hey, this is our monthly fee, and you only have one number to negotiate with. But if you have
Starting point is 00:24:30 two, you can play them off each other. And we typically do. So everyone says that to me, hey, why the installation fee that's really high, my answer to that is it gives us a leverage and negotiating power when we're talking to clients and allows us to, you know, meet their level of budgetary applications, if that makes up. Yeah. And besides, you care much more about the monthly than the one time. Exactly. The one time is a sweetener for you guys.
Starting point is 00:24:54 There you go, Alex. So it's really, we're about the monthly. So the one time gives us flexibility in our pricing in a way that we wouldn't have otherwise. Honestly, I mean, like pigs get fat, hogs get slaughtered. If you can go ahead and get paid twice, I mean, look, I'm not going to stare gross profit in the face and say no to it. But I just realized something interesting about your 15K a month price point. Because this is on-prem, you're not eating enormous public infarclad costs.
Starting point is 00:25:26 So your cogs must be low and your gross margin must be just ludicrously lovely. Exactly right. So we are not paying an open AI or anything on a per query basis, which is why our fees can be fixed, right? We can say, okay, for the contract, 10,000 or 12,000 or 15,000 a month. Banks and credit unions like that. They do not like annuities that are variable. They will shut that down every time.
Starting point is 00:25:47 They want a fixed price. And so they can ask 10,000 queries a month or a million queries a month. It doesn't matter, right? And the price never changes. So we can undercut a lot of our competition on price, first of all, when it comes to these sorts of Asians and AI deployments. And then secondly, as you mentioned, you know, the one-time fee, that we're charging, you know, because everything's on-prem, everything is built in-house.
Starting point is 00:26:10 So, you know, we can really price it in a way that makes sense for the client. Has anyone ever come to you and said, listen, we want you, we want to be on-prem, we don't know how to set up our own, you know, racks. Have you ever, like, gotten the screwdriver out and, like, put the GPUs together for someone else to run the model on their own metal? Yeah, it's funny you mentioned that. So a lot of credit unions and banks, maybe even the one you mentioned top of the call from your hometown, they actually have a server room.
Starting point is 00:26:35 right, actually in the building, right? And so we've actually gone to physical server rooms before, spent a few days helping them install abacus. So maybe not quite a screwdriver, but we have been in the room and the vicinity of the actual servers that run the computational data for the bank of the credit union. So we're not afraid to be there on site if we have to. I'm now friendically Googling Credit Union's Corvallis, Oregon,
Starting point is 00:26:59 trying to figure out which one was. Yeah, but it's the one over by my old dentist. You know the place, right? You've been there. Of course. Yeah, 100%. I want to drill down on some of the tech things and get away from the money side for a minute. Response control, hallucination control.
Starting point is 00:27:12 Clearly, when you're talking about regulated industries, can't be spitting out BS. How did you guys actually go about combating hallucinations? Because I think that most models have made progress here. I think there's a good trajectory. But I wouldn't say it's something that's been resolved. And so I'm kind of curious, what was your approach? Yeah. So we have what we call chain of validation, right?
Starting point is 00:27:31 And it's a three-tiered system, Alex, and that really helps us. ensure that abacus is always giving the right answer. So as you know, with LLM, so hallucination is a big problem, right? How do we make sure that the answers that an LLM gives is accurate and grounded in real facts and data? So our chain of validation has three steps. The first is triangulated retrieval, right?
Starting point is 00:27:53 So Abacus will look up, so say you ask a question like, how much is a leak, right? And Abacus gives the answer. A late fee is $25. She will look to cross-reference three different sources of information to validate that a late fee is actually $25. So she'll look at their online website, make sure that the fee listed on the website says
Starting point is 00:28:14 25. She will look at their truth and lending disclosure to make sure the disclosure says $25. And then she'll look at their internal policy documents to say that shows that the late fee is $25. And the she here is Abby the agent. That's right. That's right. Abby the agent.
Starting point is 00:28:29 And so if all three of those sources match, the data passes the first step in chain of validation, which is triangulated retrieval. Then there's a second step in chain of validation, which is claim decomposition engine, right? So we take the logical structure of a claim and we decompose it, right, and make sure that that's valid with the fact. So let's take our example, a late fee cost $25. We're talking about a thing, a late fee. We're talking about a fee, and then we're talking about amount, right? We take the thing, the fee, the amount is the logical claim that we're trying to validate, and we validate that claim against the information. So with an LLN, they give a lot of, you know, very colorful answers, right?
Starting point is 00:29:11 Answers that are meant to sound very human-like and quality. We're extracting just the logical claim that's being made. The thing is saying it costs this much. Is that true? You know, customers are allowed to sign into online banking and set up alerts. Are customers allowed to sign into online banking? And once they're in there, are they allowed to set up alerts? that's a logical statement, right, that we're trying to validate.
Starting point is 00:29:34 Okay, why wouldn't you do that before you did the triangulation of the $25 price point for this late fee that we're discussing? Because to me, it sounds like you would want to get to the absolute nuts and bolts of the logical progression and then check that against the facts. Yeah, it's a good question. Because if you check the logical progression of a statement, right, and it is valid, but the validity. So if I said, for example, all hats are dogs. That is a valid logical statement. It's not a true logical statement, but it's valid, right? All right, everyone knows that CRM isn't just software.
Starting point is 00:30:12 It's basically the heartbeat of your business. But it can get ugly quick if your data isn't organized and you're dealing with a messy tech stack. That's why I love HubSpot for startups. It's the all-in-one customer platform. So you don't need a frankincite of tools. No. Right now, early stage companies are going to get 75% off. And with this one system, you're going to automate marketing and actually converts,
Starting point is 00:30:38 track your sales pipeline without spreadsheet chaos. And you're going to manage your customers like the Amman Hotel, six stars all the way. You're going to get investor-ready analytics that tell your story perfectly. And man, when you pull up HubSpot and you get those metrics, you got those analytics, things are going to go really faster for you as a startup with potential investors. Plus, you're plugged into an amazing community of founders. who've already tackled what's ahead. They've been around those sharp turns
Starting point is 00:31:04 and they can tell you how to navigate them. HubSpot was built by scrappy founders. I know them. And they understand every dollar counts. That's why hundreds, thousands of startups trust HubSpot to scale their businesses. Here's an amazing call to action, so generous from my friends at HubSpot,
Starting point is 00:31:22 75% off. That's right. 75. Not 7% off, not 5% off. 75% off. HubSpot for startups. You're going to get three months of perplexity AI for free. That's a great pot sweetener.
Starting point is 00:31:35 Head to HubSpot.com slash startup. We're not just checking the validity of the logical statement. We're also checking the truthfulness of the statement. So the first is we want to make sure that the information going into that logical statement's true. Once we validate that it's true, then we validate the actual logical claim, right? Is the claim valid? So if they were, if something were to say a late fee is negative $100, that's not a logical claim, right? So that's not a valid logical claim.
Starting point is 00:32:00 So we know immediately something's wrong. So that's why we do it in that order. But then we have the third piece, which is what we call fact backpacking, right? So every single answer, Abigiz, comes with the actual document that provided the answer. We are in the document that answer came from in a full citation. So document ID, paragraph level hash.
Starting point is 00:32:18 So if that person using Abigus wants to see, okay, Abigus saying is $25, I just want to make sure that's right, you know, for my own sanity check, they can click to see the actual thing. physical document, okay, this is where it came from. This is the exact paragraph. So they think, oh, yeah, it says $25 there. I know the answer is right.
Starting point is 00:32:36 Is the person here that's checking this, the end user or the company itself who's vetting their own system? The end user. So we empower all of the end users, no matter if they're a teller or an executive vice president, if they receive any answer that Abacus gives and they want to make sure that that answer checks out. It sounds about right, but I don't know about that. Is it really 25?
Starting point is 00:32:56 I thought we changed it to $35 last month, right? they can click on the actual answer, right? And it'll show them, oh, yeah, in Policy 568, where it talks about late fees in this paragraph, I can see right here, Abby, highlighted exactly where it came from. Can I just say that if your credit union is charging a $35 late fee, get a new one. That's worse than Citibank, who I hate. Yeah, no, credit unions are really good about late fees. I'm just making up numbers here.
Starting point is 00:33:23 I know. I'm just really fantastic. Yeah, no, but I agree with you. Late fees can get completely out of course. control. I'm curious, with this three-part system, do you get hallucinations down to zero, down to a de minimis level, or down to a merely acceptable but still improving level? We have a 98% accuracy rate because of this system, right? Anything that the system flags, it kicks out, right? And because we train on financial data, you know, we know the kinds of
Starting point is 00:33:51 questions they're asking, you know, we can be very, very precise. So we have 98% accuracy rating. hallucinations are incredibly rare if they happen. We're able to get it down to a level so low that they're not material. And that's where the trust comes in. So once clients learn to trust Abby, you know, sky's the limit as to what you can do. That's amazing. I'm so happy that we've gone from, you know,
Starting point is 00:34:14 attention is all you need to chat GPT to this level of deep vertical integration in, you know, less than a decade. I mean, that's, that's why technology is awesome. Like, I mean, my God, I do hate that I feel so incredibly behind every single day because I feel like I'm sticking my head into a fire hose of news. But then you see where the rubber meets the road and it's pretty exciting. This is actually going to make smaller financial institutions more competitive, more efficient and just more viable. So that brings me joy. David, now, about your company, though, how fast are you growing?
Starting point is 00:34:45 We've grown tremendously fast. We have more interest than we're able to service right now. So we're really trying to, you know, move the needle on our fundraising round. and build up some infrastructure here so we can do that. Just it last year alone, we went from 25,000 in ARR to 125, or excuse me, MRI, MMR, 25,000 MRR to 125,000 MRR by the end of the year. So we have grown incredibly fast so far.
Starting point is 00:35:16 When I think about venture capital, I know this is this speaking startups, not this big in venture capital, but it often feels like the same thing. A lot of founders need to raise money because they need to, build, they need to reach a certain milestone, hire certain number of people, buy a certain technology or whatever. And so it makes a lot of sense to have risky capital in there. But going from 25K MRR to north of a million error in a year, I mean, it feels like you probably have so much more income to play with that you're probably not as capital constrained as your average Joe in the startup game. So why is it the right move for you guys to go out and
Starting point is 00:35:47 try to raise more money versus just self-funding? Because it feels like you have a fork in the road here, if you will. You're right. When I started the business, you know, I started the business and I wanted to run it like a real business. You know, we're cash flow positive. We don't have a burn rate, you know, because that's, you know, I'm from Chicago. I'm from the Midwest. You know, I'm not one of these fancy San Francisco boys. So I started it, the old-fashioned way, my business and, you know, worked at it slowly as we've made progress. To me, making money was the definition of, okay, now I have a business. We have net income. So you're right. We do have capital, we do have money coming in the door. So why venture capital? And the answer, Alex,
Starting point is 00:36:27 is, as much as I hate to say this, to win in this space, you have to be first, right? The company that these credit unions or banks or insurance companies adopt first is going to win, because once they're in there, they're very hesitant to change, right? Some of these banks unions have systems from the 80s, and they don't want to change it because it works. So the same thing is going to be true of whatever AI system they adopt. So we need to get there first. To get there first, We need a large infusion of cash so that we can move very, very quickly, make sure that we're capturing as much real estate as possible, you know, as the AI space is building out and companies are starting to adopt this technology.
Starting point is 00:37:05 And so that's why the venture capital coming in. So the gates open, you're running across the field to get the best spots at the concert, and you want to run even faster to beat all the nerds out, so you raise money. Okay, I'm here for that. But if you do end up raising some money going out there and then going back to profitability, Well, that would make my Chicago roots very happy to hear because I love net income. Those are my two favorite words in the English language. All right, David, before we go, one, what's the website?
Starting point is 00:37:29 And two, what is a role that you are hiring for? You're having a hard time landing the candidate. Yeah, so goadvocis.co is the website. And then a role that we're hiring right now, sales, sales, sales, sales, and we need people that have expertise in that industry, so banking, insurance, credit unions. So those are the roles we're really looking to fill right now. So if there's anyone out there listening that's interested, please let me know and go to the website and hit contact us.
Starting point is 00:37:55 But we're looking for people with that industry experience and that sort of sales point of view or disposition. So all my cool salespeople out there that know those industries, let me know if you're interested. All right. Well, David, thank you so much. And when you, I don't know, whatever your next milestone is, two millionaire are, whatever it is,
Starting point is 00:38:12 I'd love to have you back on to learn more about it because companies that grow as fast as yours are are on to something. So in the meantime, good luck. enjoy the Chicago summer. It'll be tank top season for a solid three months before it freezes again. We'll talk to you soon. Thank you, Alex. Thank you very much.
Starting point is 00:38:25 I told you that was going to be fun, but let's take it one step further. Let's go talk to Synthesia, not Sintesia. Come on, get it right, people. And let's learn more about AI generated avatars, training videos, video models, and all that good stuff. Let's go. We have been talking about AI so much on this show. I'm sure you want us to be quiet about it. Who wants to see more text generated by a robot?
Starting point is 00:38:47 Well, good news today. we are going to stay on AI, but we're going to move away from the written word and instead focus a little bit more on video. Now, regular viewers of the show will recall that some time ago, I heard tell of the neat new product called Synthesia, and I wanted to play with it, so I made a video, brought it to the show, showed it off to Jason.
Starting point is 00:39:05 Well, today we are bringing the company on the show to talk about what they're building, because I had added them to the Twist 500, partially because they raised a big ground earlier this year, but partially because they're growing very, very quickly, and we'll get to that. Please welcome to the show. It's Victor Ripperbelli, the CEO and co-founder of Synthesia.
Starting point is 00:39:23 Hey, man. Good to be here. Thank you for being here. For folks who are watching the video, it's actually quite late over in London where he is. He's been very generous with his time. But first of all, let's start right there. Building an AI company in the UK, I've been told Victor that if you're not building within three blocks of one part of San Francisco, you're doomed and yet you're absolutely
Starting point is 00:39:41 not. So what's it like to build an AI company over in Europe? There's pros and cons. I think if you weigh up everything at once, it's probably still better to build in S.F. For the U.S., but there's a lot of benefits, actually, for two building in Europe. One of them, I think, is actually talent, right? So the price of talent in the Bay Area is extremely high compared to the rest of the world. And it's not just about the cost.
Starting point is 00:40:06 Actually, I think there's also a loyalty element to this, which has two sides to it. In the Bay Area, most people are building their own personal stock portfolio, and they'll very quick to jump ship if you miss a couple of quarters, right? In Europe, there's a side of a different way you relate to your work. It's less transactional in sense. People care about the mission, the company they work in, and they don't care much about stock options, where it definitely has its downsides as well.
Starting point is 00:40:29 But my general sense is that people are less jump in Europe, and I think that doesn't cost beneficial if you're the founder of a company, right? You know, when we started the company back in 2017, the capital market was not nearly as global as they are today. I think that has kind of like equalized today, more or less. I don't think it's particularly harder to waste money in Europe than is in the U.S. You're less close to the ecosystem. I think that matters a lot if you're building, if your customers are other tech companies,
Starting point is 00:40:55 and that's one of the things that we aren't, does not be the thing for us. It's not a bit of top five industry for us. But obviously, if you're building, you know, developer tooling, you probably want to be in Bay Area that's just proximity to like all the tech companies who are ultimately your customers is much better. But I think building in Europe, building in the UK is getting better. and better and better and better. There's also the benefit of like, you know, a company like us,
Starting point is 00:41:17 who are one of the companies in Europe that do really well, if you're a great European engineer, there isn't that much choice, right? In SF, there's like a thousand cool startups who've laid lots of money and do very interesting things. So, I mean, pros and cons, but overall, I think Europe and the UK is actually become a pretty good place to build. I don't tell anybody, but I've actually scooted the UK back into the EU mentally.
Starting point is 00:41:38 I'm just preparing for a post-Brexit feature, so we'll see how long that takes. but I think we can kind of see the writing on the wall. Okay, Victor, what I did there was actually not start with what the hell you're building. So Synthesia, in my understanding, is a tool that I can use to create essentially AI avatars in videos. And these are aimed not at the consumer use case. I'm not making a 30 second clip of Gandalf walking around with Frodo, but instead I'm creating video materials for a corporate environment. So one, how close was that? And two, narrow it down for me.
Starting point is 00:42:10 Not bad, but I think a lot of people have this perception that we're an AI avatar company, and that's definitely the way we started, like, got it started, but that's kind of how we initially, you know, went to market, the hit product market fit for five years ago in 2020. I think by now we're much more than that. Advertars is a feature in our platform, but really what we are is an AI video platform for the enterprise. We cover the entire lifecycle of the video. We help you create the content, which starts with the AI models, the avatars, right, which replace the need for a camera.
Starting point is 00:42:38 But it's also a fully-fledged video editor, sort of like using Canva or PowerPoint. It's a modern collaborative platform that can support enterprise with thousands of people, creating things simultaneously, organizing themselves around that. It's a content management system for translation, updating, versioning. It's a publishing platform. We have our own AI video player that's built to show the videos you make into Thesia and deliver your insights and analytics back. So really, it's about the entire value chain.
Starting point is 00:43:03 It's not just about creating clips of Adetatts, even though that's where we started. you're right on the use cases. We're not targeting entertainment. We're not targeting advertisements, really. And the best way I'm thinking about these here and the market who serves today is people today want to watch and listen to content they don't want to read that much anymore, right?
Starting point is 00:43:20 In our private lives, everyone, I think, will agree with that. We're recording a podcast with a video right now. And most people prefer to consume information this way. And that's because when we're a free choice, that's what we do. But when we go to work, right? We don't have free choice most of the time. You have to basically consume a lot of text, a lot of emails, a lot of slides.
Starting point is 00:43:36 And that's just a much worse way of communicating to your customers, employees, your partners. That could be anything, product marketing, to customer support, to internal trainings. It's a much worse format if you're using text. If you're communicating with most efficiency, you need video. And what we've built is a platform where enterprises can essentially use our tooling to communicate in the most effective way, which is with video. But at the speed and scale of text, right? You don't have to use cameras. You don't have to use ad hoc.
Starting point is 00:44:02 It has to be great at video editing. it also comes out the box in a PowerPoint-style experience. And the markets we serve five years ago when we launched the first iteration of the platform, it was very much internal-focused, learning and development, work out training-oriented use cases. But we've seen is that as the advertiser and voice technology gets better and better,
Starting point is 00:44:21 and you unlock more and more tan. So today, it's 52% of all videos generated that are external-facing, which is like customer support. Oh, so over half now. It's over half now, right? And a lot of that is like product marketing, for example, but it's not like the fancy meta ad that attracts you initially. It's the mid-futnel content, right?
Starting point is 00:44:40 It's like you're interested in a product, you go to the website and you want to learn how does my product compete against a competitor. That can now be a video instead of a long page of text. Your consumer and you're trying to take out of mortgage used to be faced with a long page of text that very few consumers can sit down and comprehend. Now we can watch a five-minute video instead. So it's not like the top-level awareness stage of the funnel. It's much more in the kind of information sharing how to start.
Starting point is 00:45:04 of content that we found on mission. Now, when I showed the demo, we used a model of a woman sitting on a couch wearing a striped shirt, for example, that's kind of the thing you have on your front page. Can I use Synthesia to make a video or an AI digital twin of myself? Or am I always going to be working with a model that you guys have provided it? And also, I presume, tuned and tweaked to fit your voice algorithms. So you can make your own avatar, a very popular feature on the platform, one of those popular ones. It's super simple. You can use your webcam. It takes five minutes. You basically
Starting point is 00:45:37 just recite a script that we give you. You can also, if you want higher quality, you can go to a studio, you can record it with a camera. And essentially, like, the quality you input to the system is the quality that you get out. And I mean, these technologies are magical now, right? I mean, if you try cloning your voice at some part in time. Actually, I haven't yet because I think my voice is very annoying and I wouldn't want to listen to it more. So it actually hasn't occurred to me. But I will play with that. Yeah, because you can speak, you know, we can speak 30 languages at the click of a button. And it is pretty magical to listen to your own, like, actual tone of voice,
Starting point is 00:46:12 being in Spanish, German, Italian, whatever language you want to speak. That actually, that is freaking cool. And I presume because we're talking about multiple languages here, that the service itself does support much more than just English. All right, we support 140 different languages, which is also a key part of our value proposition, especially for global companies, right, who need to communicate across borders. all the time. So let's talk just a little bit about training data because you guys recently announced a deal
Starting point is 00:46:37 with shutter stock to ingest some of their visual information to help build out your Express 2 model. But when I'm thinking about how to nail an Irish accent, I mean, is there a repository? You can go out there and train against you to collect out yourself. How did you go about supporting that many different languages and speaking styles? So that particular part is separate from the deal with the shutter stock. If we start to kind of the video side of things, which is really where the stock is kind of relevant here,
Starting point is 00:47:08 basically what's happening in the world of AI, as we probably know, right, if the models get bigger and bigger and bigger and bigger, and they get more and more generalizable. LLN's kind of like the first iteration of this, where we built something which was essentially highly generalizable and you just do a lot of different things. It wasn't like a model that was like really good at doing just one thing.
Starting point is 00:47:26 In the world of video and avatars, if you look at what the product is capable of today, it's essentially people talking to the camera, and the quality has gotten really hard, eye, it's kind of edging on looking like a real video, essentially. But there's a lot of more things we want the avatars to. They want avatars that can laugh, avatars that can cry, avatars that really use their body language in the correct manner.
Starting point is 00:47:46 When you look at me speak right now, I use my hands, right? There's a beat to what I'm saying. The avatars are not there yet. They will be with these new flagship models. And essentially what you want is for our AI models that centers on human speaking, we want them to see lots of different scenarios, lots of different types of people interacting in many, many different ways. and that requires lots of training data.
Starting point is 00:48:05 So we both spend a lot of money on procuring our own datasets, working with actors all around the world and studios, 3D data and so on. But it's also really helpful with these models to see kind of a glimpse of the world from Shutterstock, which has a lot of content on that platform. So much.
Starting point is 00:48:22 Okay, so it sounds like this is very, very hard to do well. So you're using both data from other people, getting your own data from actors and voice and motion capture and so forth. how much better will express to the model, I believe you're currently training right now, be compared to what I can currently see via the Synthesia website?
Starting point is 00:48:42 It'll be a lot better. I think what we're getting to now is maybe it's helpful just to outline how I view the kind of avatar space in any video in general, right? So I think these technologies started five years ago, we were the first ones to launch the public and back when we launched the first iteration of them,
Starting point is 00:48:59 they're pretty crap, to be honest, right? The voices sounded like you're kind of talking to a GPS, and the video was kind of stilted, but it was kind of like good enough for some use cases. And we were some of the first to discover what those use cases were, and they were definitely not Super Bowl ads at all, right? But for internal training videos, where the alternative was asking people to read 10 pages of PDF documents,
Starting point is 00:49:20 these sort of crappy avatars actually was a much better experience, right? Now, what has happened since then is the quality has gotten better and better and better and better, but we're still in the realm of what I would call educational, how to, very kind of utilitarian practical content, right? It's about me delivering some information to you in the most engaging way. It's not yet about storytelling. Storytelling to me is making you feel something, making you laugh, making you sad, making you happy, right? That's what a real actor does.
Starting point is 00:49:45 That's what you look like a great app, right? It makes you feel something. That's what you see in the entertainment content. It makes you feel something. And I think with these new models, we'll actually begin to cross into that because what's going to happen is that the avatars can begin to perform their lines like a real actor. When I speak to you right now, my voice kind of slows down and speeds up, and I emphasize a specific word or use my hands to underline something.
Starting point is 00:50:07 That's how we speak as humans. That's what's most natural to us. And when we can teach these avatar to do that, we are going to be able to create content that's going to be even closer to the real thing. And it's going to have all the emotions and explicitly you would have from a real actor. So I think with these new models, I think we'll break through into storytelling. We'll begin to see the first iterations of creating content that's not just meant to, like, in form, but also kind of like entertain,
Starting point is 00:50:29 storytell, make you feel something. And I think that's going to be a pretty interesting watershed moment. We've seen that in other modalities, something like if you just stick with just the voice part. Essentially, before a couple of years ago, voice technology was only used for assistance in your phone and for GPS, right?
Starting point is 00:50:45 You would never listen to an audio book that was done with an AI voice. It was just way too bad. It was terrible. Now you have companies that basically, you know, take real books and turn them into all. your books and it's a great listening experience because all the emotion explicitly is conveyed. But we haven't had that moment for video yet, and I think that would be a pretty big watershed moment
Starting point is 00:51:04 that just unlock so many use cases. So I think it's pretty exciting. So what I just heard is that when the Express 2 model comes out from Synthesia, we're going to cross over to almost a new generation of AI video quality. It's going to unlock a lot of other use cases both in the corporate world, and I presume also at some point in time in the consumer domain as well. Absolutely. Yeah. That's incredible. So I'm going to go off topic here and talk about where I see this going and I'll bring us back to the corporate world and how you're doing and so forth. But I recently was playing with Open AI's voice mode, just chatting with my AI.
Starting point is 00:51:38 And I, we had a talk. I gave it a name. It was kind of weird. I almost felt like I was crossing some sort of like ethical boundary. We were just talking about like the stock market, but it still felt different in the interesting way, almost like I was talking to a person, even though I knew better. If I take that moment of like, oh, wow, this is actually really good and apply it to what you just told me about what you're going to pull off with the Express 2 model video, I mean, how long until I have like my AI is not only persistent in terms of memory and knowing me, but also has like a look to them that I would expect to persist across my digital environment. It feels like to me, you're building the real front end for AI here in a way that other people haven't yet and they're still trying to give.
Starting point is 00:52:22 me to type into a chat box. So to me, what you're describing feels like it opens up a Cambrian explosion of possibilities versus just making middle funnel corporate materials that much more beautiful. For sure. I mean, I think we're so early in all this technology. And as to your point, right? I mean, these things will be real time. There'll be, the fidelity will be 10 times what it is today.
Starting point is 00:52:44 And you will be, at some point, right, probably have like AIs that can tell joke if you laugh laugh at, but sounds weird. We may even at one point have you coming home after a long day of work and sitting down on your couch. And instead of turning on Netflix to entertain yourself, you may actually begin talking to an avatar go through some interactive kind of experience, right? It's kind of hard to imagine. Five years. Yeah, I mean, maybe even before that, but definitely in five, ten years, you know, I think this is going to be a completely normal part of living a life. And it'll be weird.
Starting point is 00:53:13 There'll be lots of challenges. But ultimately, I think what this promises, right, is that the interactions we have with the computer. is going to be much more natural the way our brains are wired process information, right? Like, they know, like using a keyboard and a mouse, it's natural to all of us today
Starting point is 00:53:28 because we've, you know, we kind of grew up with it. But everything really is a proxy for how we actually prefer to interact with each other in the real world, right? Which is we talk to each other, we see emotions,
Starting point is 00:53:37 we use our hands, we show people things. I think we can just, we can get much closer than. And it sounds like you had the same experience, but the first time you tried something like opening ass and voice more, right?
Starting point is 00:53:46 It is kind of like a holy shit kind of moment where like you almost cannot believe it's real. But like the first time you use chat GPT, right? It's truly magical technology. The flip side of that is also very interesting how quickly we just get really used to things, right?
Starting point is 00:54:01 Like with chat TVT, it's like if you took chat TVT back like 40 years in time, you'd get burned at the state, right? Oh, absolutely. Someone invented is like insane. But now everyone's like, it can't solve all by homework in one prompt, right? I have to do it in two prompts.
Starting point is 00:54:19 That sucks. We get very used to this very quickly. Yeah, but I mean, we get used to it quickly because our expectations rise so fast. So the first time that I saw a Synthesia video, I thought to myself, this is 90% of what I could want. How dare they not give me 100? Which is an insane perspective to have, given that, you know, not that long ago, this was, as you said earlier, poor voice, still did a video, kind of crappy, but definitely a great first step progress. But compared to today, I mean, it's mind-blowing how much better what things are. and I almost feel like I should be more enthused.
Starting point is 00:54:51 Like I should be more excited day to day, but instead tools become wrote very quickly when you actually go about using them. Exactly. And I think a great analogy here, right, is like visual effects in movies or computer games, right? If you go back and watch a movie you watched when you were 10 years old, you were like mind blown as the realism
Starting point is 00:55:11 and you're back then you're thinking like, this cannot get better. We've reached like the apex of like visual effects. Yeah. And then when you go back and watch it today, it looks like crap, right? So we just get used to this stuff so quickly. It's also interesting, especially when you work with humans, which we do, right?
Starting point is 00:55:25 Humans is the hardest thing to synthesize and make AI videos about, right? Because we're so sensitive to even like the smallest inconsistencies, right? Like, when you look at a human, we have so many like micro interactions, the micro movements that goes on all the time. And we're incredibly good at spotting if something is like just a little bit off, much better than if you're just making a video of like waves in the sea or something like that, where I think in a lot of AI models now are capable enough to do that. But it is a body, it's kind of a moving goalpost, right?
Starting point is 00:55:55 Which is kind of like very fascinating. Okay, let's bring it back to today. So actually, as this video comes out, we're recording this just a little bit early. You guys are announcing that you've reached 100 million in annual recurring revenue. One, congrats. And two, and this is the stuff that actually blew me away more, is that 70% of the Fortune 100 are now covered. and that's up from 40% about two years ago.
Starting point is 00:56:18 You're getting pretty darn close to having every single of the largest companies out there a customer for you. So I'm curious, what can you tell me about what they're using Synthesia for today? Has the use case changed at all? Is it where it was, even though you're going into the large companies in the world, or do they want something else out of you? It has definitely changed over the years, you know, and we're very fortunate to have worked with lots of our customers over many years and gotten into, you know,
Starting point is 00:56:41 larger and larger contracts with more and more people in those companies used and as I can't describe a bit earlier, right? It still revolves mainly around like practical content and utilitarian content. It's about informing someone of something. It's not storytelling ads and those kind of things yet, right? But what we have seen is that a lot of our customers that started working with us a couple of years ago, maybe they started with through like internal training and learning, but then since then, like now they're using us to customer support.
Starting point is 00:57:07 They're using us for product marketing. They're using us for, you know, partner integrations and a whole bunch of other things. So I'm curious about your costs because the technology is awesome. You guys did just raise $180 million series D. I think you said it was a $2.1 billion valuation. That's a big chunk of capital. So do you guys have very high compute costs? Is training very expensive?
Starting point is 00:57:29 Or is that money more earmarked for people, marketing and so forth? So it's always been very important to us to build a great business. And not just in terms of top line growth, but in terms of unique outcomes. a really, really solid business, kind of like a margin's perspective, retention metrics and those kind of things, which is something I'm really proud of, almost more proud than a whole non-AIR figure. Now, that's said. Of course, we do spend a lot of money on training AI models, right?
Starting point is 00:57:54 Like, it's important for us to stay the leader of the field, have the best models and so on and so forth. But I generally take a very practical approach to how we train models. I don't want to just train models for the sake of training models. There's a lot of companies out there that start with the technology, right? So let's train an AI video model that can do absolutely anything. If you want an AI video, I'd do absolutely anything, that's going to be a very big training one. It's going to be very expensive,
Starting point is 00:58:17 and your model is going to be kind of okay-ish, a lot of things, but not really, really good at one thing. And for some use cases, that's the thing you want, right? It's not because of, I'm saying that that's not great. What we're doing, though, is we're focusing purely on humans talking to the camera. So that's a much more narrow domain than all the videos in the world. That also means that we can be smart around like how much training we need to do. We can work with open source models.
Starting point is 00:58:40 We can add a lot of our own data, but of course, ton of old algorithms. So even though we do spend a lot of money on AI models, that's not the predominant reason for raising this capital, right? That is predominantly going into headcount and all the usual things you'd see in a SaaS company. But I think it's kind of interesting how the market profile of AI companies, right?
Starting point is 00:58:58 It's like they look very different depending on what kind of model you are. In general, it looks like it looks better if you're like workflow-driven and use case driven, as opposed to being in the model layer. Yes. Because when you build a workflow on the platform around, your model, right? You're not really charging for the model. You're charging for the work for you selling to our customers. And for us, right, what that means is that, to the point I made earlier, right, it's like, yes, the avatar is a feature in the products, but they are a part of the product,
Starting point is 00:59:24 right? What people pay us for is not just avatar videos. They pay us for translation, version, video editing, collaboration. They pay to use a video player and the analytics suite that comes with that. And I think that means you can build, well, the different shape of business, of course, like being a foundational model company and is amazing, like looking at open-air and probably all these guys, but they're going to be great companies,
Starting point is 00:59:45 but it's going to be very few of those. And I think if you're not going to be one of those truly big winner-takes-all kind of companies, I think you want to be very smart about what models you train and how much money you pour into that, right? Because if you pour in, if you're $200 million to train models and you still don't have the best model in the market,
Starting point is 01:00:01 you're basically kind of screwed. So we think a lot of, we think backwards from the use case and for the workflows, and when we need to train all models, we do our own models, do that. If we need to integrate other models into our ecosystem to create the best experience for our customers, we'll do that. We're not religious about having to do everything ourselves, right? And I think that's been a key part of our strategy for many years. Hey, if OpenAI,
Starting point is 01:00:22 Anthropic, Google, X-AI and Mistral all want to spend $100 billion making something awesome and then beat each other up on price and then offer it to me, cool. I'm in. I'm happy as a cat because I don't have to lose all that money and I get the best stuff. So I do really appreciate that. Now, we talked about the Fortune 100, clearly biggest companies out there in the world. I'm curious about the next like 5,000. So are you guys seeing, you know, smaller, more mid-market companies have a similar appetite for what Synthesia offers, essentially video AI technology? I mean, we have customers that are of all size, right, from just individuals all the way up to, you know, the world's absolute biggest companies. And we have a self-service product,
Starting point is 01:01:03 we have a freemium product, which I think you've interacted with yourself, right? So all that is, of course, also very important for us. But to my point earlier, around focus, we're building for the enterprise, you know, and that means we're building for businesses, and that means we make a bunch of tradeoffs that makes our product amazing for the enterprise and for bigger businesses.
Starting point is 01:01:19 And maybe sometimes means that we're not always like priors as it needs of like a small business. That said, you know, we work with lots of small businesses who love the product and we love working with them. And there's going to be, I mean, tools like Synthesia, I think the best way is thinking about the ultimate tam of something like what we're building right now
Starting point is 01:01:36 is basically PowerPoint, right? Every office worker in the world today creates PowerPoints. And in 10 years, time, they're going to be creating videos instead of interactive videos. They're not going to be creating PowerPoints. I think that's for sure. And so our tool is enterprise focus in terms of like the company strategy, but the product in itself can be used by literally anyone, right?
Starting point is 01:01:55 And of course, when you have a PLG-driven model like we have, where you can go in and file a product and sign by a credit card and go for it, you get amazing. It's just amazing. one in, right, and everyone who wants to play around with it, and music can use it. It'll be interesting to see if the split between enterprise companies using it for internal knowledge sharing and smaller companies using it for marketing external facing, you know, creations, if that holds as the models improve over time, or if it becomes much more blended. I presume it'll become much more the same across the size of customer.
Starting point is 01:02:27 But the question, I guess, then, is just how quickly. But I guess things are moving so fast in AI, maybe, you know, soon, I guess is my answer to that everything feels like it's about six to 18 months away. Well, I think there'll be different. It's going to be nuts. It's going to be nuts. But I think also, I think what you'll find is if you think of, like, the total market for this being any communication touchpoint,
Starting point is 01:02:53 if it takes the PowerPoint or video or whatever, as the market that we're targeted, right? I think the shape of the different types of markets will be different. So, for example, in internal facing communication, you'll have a lot of videos that'll all be very different because they're all talking about some different topic, right? Which makes sense, right? You're teaching your partner how to do things,
Starting point is 01:03:11 you teach your customers how we use the product. There's just so much communication that happens in an enterprise and with that kind of close stakeholders. Now, if you take advertisement by shelf, right, that's probably going to be different to shape. That's probably going to be more. I'm going to spend a lot of time on this one video and I'm going to make it really good.
Starting point is 01:03:28 And then I'm going to iterate like 10,000 variations of that video. I'll change out the avatar to be slightly older, slightly younger, I'll change like the hook, right? I'll change the way the product is positioned in there, because that's a conversion rate optimization game essentially, which is what you're doing on meta, adworks, et cetera. So I think that's going to be more about like you create like a couple of great base ads and then the system, you know, creates many, many, many different iteration of that to figure out which ones works the best. And this is actually exactly what has happened today in, especially in text, right? With AdWords today, you don't, when I, when I was a
Starting point is 01:04:02 teenager and I did AdWords, right? You would have to write all the different AdWords iterations yourself. What you do with AdWords today is you can't just give it a theme, right? It's like, this is my product. It does X, Y, C. This is my competitors, whatever. And then AdWords just automatically generates, I mean, thousands or hundreds of thousands of different variations to figure out which one performs the best, right?
Starting point is 01:04:22 Now, that's not possible in video today because we can't really programmably create video at that level of scale, but that is going to be possible very, very soon, right? So I think it's got very interesting to see the difference between this kind of algorithmic creation of video for conversion rate optimization. And on the other hand, we'll probably have more kind of PowerPoint style creation inside companies where you'll still want to have like an onboarding video for the sales team. You'll have like a specific video explaining how a partner should integrate with your platform and the pricing policy and all those different sorts of things. Which is still very early, but there'll be so many different shapes of products they'll, you know, come to rise in this ecosystem. This is mind-blowing because I just realized that eventually, you know, run the tape out far enough. Instead of having text ads that are targeted for my demographics, my age, my location, my education level, whatever, it's going to be like, here's the best video avatar to sell X to Alex.
Starting point is 01:05:17 And here's the best video avatar to sell Z to Victor. And I'm really curious what that is for me. Like, I don't know. Is it like an old man? Is it like a woman my age? Like, I have no idea. but someone's going to math that out and I'm going to get told a lot about
Starting point is 01:05:32 myself based on how I'm sold to and I don't know if I like that. That just feels, is that therapy or is that advertising? As I said, I mean, there's going to be, it's going to be weird and it's going to get real quick and lots of ethical considerations and culturally, right?
Starting point is 01:05:49 I think there's, the fact that these technologies are developing so quickly is very exciting, but humans are much, lower to change and to understand new technologies and what's happening in the world than technology is developing right now, right? And that definitely creates some problems or some potential challenges that we have to make sure we're kind of square up to.
Starting point is 01:06:13 All right. Well, Victor, an absolute treat. When Express 2 comes out, I want you to come back on the show and show it to me. And I'm curious to see how far things can go. But in the meantime, I'm going to go make your AI avatar tell me more heavy metal facts because I can. What is the URL for people to go check it out themselves? and quickly what's a role you're struggling to hire for.
Starting point is 01:06:31 Right. At www.suntisia.io, go in, create a free account, make an avatar of yourself. Right now, we're struggling to hire, and we're hiring across all roles, and we're building an amazing go-to-market team across North America and Europe, and we're building amazing engineering teams in Europe. So if you're in one of those two camps and you want to join a company that's become a $100,000 company one day, then you know where to go. All right, thanks, Victor.
Starting point is 01:06:59 I absolutely love talking to founders. It actually just never gets old. I've been doing this for, what, a decade and a half now? And every time you talk to a new founder, you leave pretty darn energized. We're back tomorrow with our live news team with Jason and Lon, but I do really enjoy getting the chance to sit down with founders, dig a little bit more deeply into what they're working on, and just riff, just learn what's working in the market, what's not,
Starting point is 01:07:22 where is the capital really flowing? So expect a lot more of these. I have a bunch in the can. We're recording a lot more this week, so there's a lot more twists coming your way. I'll see you tomorrow. Bye.

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.