The Data Stack Show - Re-Air: The End of Busywork: How AI Transforms Productivity at Scale with Alberto Rizzoli of V7

Starting point is 00:00:00 Hey, everyone. Before we dive in, we wanted to take a moment to thank you for listening and being part of our community. Today, we're revisiting one of our most popular episodes in the archives, a conversation full of insights worth hearing again. We hope you enjoy it, and remember you can stay up to date with the latest content and subscribe to the show at datastack show. Hi, I'm Eric Dodds. And I'm John Wessel. Welcome to The Datastack Show. The Datastack Show is a podcast where we talk about the technical, business, and human challenges involved in data work. Join our casual conversations with innovators and data professionals to learn about new data technologies and how data teams are run at top companies. Before we dig into today's episode, we want to give a huge thanks to our presenting sponsor, RudderSack.

Starting point is 00:00:52 They give us the equipment and time to do this show week in, week out, and provide you the valuable content. RudderSack provides customer data infrastructure and is used by the world's most innovative companies to collect, transform, and deliver their event data wherever it's needed, all in RutterSack. real time. You can learn more at rudderstack.com. Welcome back to the Datastack show. We are here with Alberto Rizoli. And Alberto, I cannot wait to dig in on this episode because you've been working in AI for more than a decade, actually. And so I think you're going to bring a wealth of knowledge to our guests. But give us the quick one-minute background on yourself. In a minute, I'm Albert Ruzoli, co-founder and CEO of I have started my journey in 2015 in AI.

Starting point is 00:01:42 So shortly after the first Khrsevsky paper to demonstrate it that GPUs can train neural networks efficiently. And so I've seen all the iterations of AI hype and then AI dehypeization. And we're obviously in the most exciting time ever in the field. And I think we have enough of a technology to change the world in the literal sense. and the way we're doing it at V7 is by enabling people to automate the most boring parts of their work. Everything that involves document understanding, back office workflows that are slowing down enterprises massively and that usually involve dozens or hundreds of people doing work that they frankly would rather not do and consider to be repetitive and giving them the chance to develop systems to automate that on their run.

Starting point is 00:02:34 Alberto, it's going to be really fun to talk about. So I'm super excited to talk infrastructure with you. We talked a little bit before the show. There is a wide variety in infrastructure and AI today that people are selling under these same marketing campaigns, and it looks really different. So we're going to dig in a little bit into that topic. What's something you're excited about chatting?

Starting point is 00:02:56 I think I'm very fascinated about what the end stage should look like. So today, we're all dipping our toes into AI workflows with in our own businesses. some people are taking it to the extreme quite successfully, but what I'm interested in is like, what will the Fortune 1000 look like in five years if they correctly implement today's AI paradigm? And what will the perceived roles that are affected by it

Starting point is 00:03:19 need to do to adapt and successfully adapt to all these changes? And then, like, what are the things that AI is still not very good at doing that we probably should not bet our careers on? Awesome. Let's dig in because we have a ton to cover. Alberto, like I said in the intro, what a privilege to have someone who's been working really at the forefront of AI for over a decade. It sounds weird to say that because the industry is moving so quickly. But before we dig into your background and then, you know, talk all about data infrastructure and AI and what you're building of V7,

Starting point is 00:03:53 I have to ask more of a personal question. So you've been an entrepreneur since you were 19. You've been working in the forefront of AI. and that's so interesting because, you know, kind of what you've known in your professional career is being your own boss, essentially, and working with the latest and greatest technology. If you had to go get a job outside of tech, what would you do? Like, what interests you? Great question. I've asked myself that many times. Probably hard to answer after so many decades in the saddle as an entrepreneur. Yeah, indeed. It's effectively, if I hadn't chosen the entrepreneurial career,

Starting point is 00:04:37 I would still be building something. And the careers that I considered when I was in high school was either theoretical physics, which does not involve building something. And so I scrap that because you have to spend 15 years and then maybe you get mentioned in a good journal. And maybe you make a good discovery. And then you're done. Essentially a teaching career. Teaching and research. Yeah. Yeah, if you can't succeed, you end up having to teach in any academic field. But the other thing I was really interested in was user experience design, specifically in the advent of the personal computer revolution,

Starting point is 00:05:14 I think we had to figure out all these things that we take for granted that have to do with user experience all the way from like the mouse and the graphical user interface. These were brilliant inventions at the time because there was no pre-existing paradigm. And why I find AI so fond these days is that it's a repeat of that. We're finding out new ways to interact with machines that have this new paradigm of an LLM or like this token predictor. And I think we're maybe like 20% of the way there in really figuring out what in which ways we will interact with computers and intelligent machines in the more distant future. So in short, I would have probably been in the product side at a tech company if I had not to pick a founder crew.

Starting point is 00:05:55 I love it. I actually was reading recently just doing some research on the chat interface. and some of the inherent limitations there and other things like that from a user experience standpoint. And one author mentioned that he said something very similar to what you said, which is that LLMs present the biggest user experience shift that we've had in 60 years and is sort of a major paradigm where you're moving towards sort of an outcome-based, you know, you're telling the computer what outcome you want as opposed to issuing a command and then getting a response. And that being an exchange where there's direct input from, you know, a mouse.

Starting point is 00:06:30 or a keyboard or even buttons that you click or other configurations that you make, but you're actually giving an open-ended outcome and expressing your intent to the computer, which is fascinating. It's hard to believe that 60 years ago was the 60s. For a second, I thought, like, oh, but the GUI was a much bigger fit,

Starting point is 00:06:47 but actually it's around that time. Yeah, I think that's precisely it. We're no long... We also are now interacting with machines that adapt themselves. They're almost like shape-shifting software and it's able to adapt to the need or tool that we need to use at the time. And there's an avenue by which maybe the feature is software,

Starting point is 00:07:07 a software that codes itself and generates what the exact minimum viable code that we need to run to do something. And it's very similar to sort of a science fiction or fantasy magic tool that just turns into whatever we need at the time. Yeah. And yeah, that brings up a lot of different possibilities. and also a bit more of a winner-takes-all approach to the world of software, which is a more concerning aspect.

Starting point is 00:07:34 Yeah. Well, let's talk about science fiction a little bit because back in 2015, you were working on the cutting edge of computer vision technology and translating images into text across multiple languages. And you know what really struck me about that, and I hope this doesn't come across the wrong way. But when I was, you know, looking at that technology and prepping for the show, you know, in terms of, you know, the timeline, it wasn't that long ago. But relative to what we have today, it felt really primitive. I mean, it felt like a very early prototype of, you know, sort of the types of things we have today. Could you give some perspective on that? Because back then it was really cutting edge. I mean, there was, there were very few technologies that could do anything like that. You point your phone at an image, it speaks the word back to you across multiple languages. I mean, that was

Starting point is 00:08:32 really wild back then. But now with GPT's real-time vision stuff, you know, it does kind of feel primitive. And that happened very quickly. Yeah, it happened quite quickly, and most of that progress happened in the last three years. Yeah. So I Polly was an app effectively to cap for images with their smartphone in real time. So think of it as a video. And then every frame of the video was understood by AI and spoken back at you. So someone with the visual impairment could wave their phone around and hear what they had in front of them. And it had a vocabulary of about 5,000 words and objects. And if you were to try it, it would have a similar wow effect at the time, as if you were to use GPT or Gemini's real-time vision features today. But only if you were an engineer,

Starting point is 00:09:23 because you understood the impressive capabilities required to make something like that work. And the actual proof points is whether your average person on the street would gain a lot of value from this type of experience, which is why it was successful with a very niche group of users, which are people that have very low vision, and some enterprise use cases that knew exactly what types of objects they needed to understand, and therefore you could limit its capabilities. But it was a good proof point of why AI only succeeds when it's able to cover a lot of edge cases. A good way to think about AI is that it's always a distribution of data and the AI model models that distribution. Think of it as a bell curve where the total number of objects that you interact with on a daily basis is like the very middle of that bell curve.

Starting point is 00:10:13 And if you have a product that doesn't capture the edges of the bell curve, which are the least common objects you interact with, you consider it to be a bit useless. In particular, because as the average person, you don't actually care about identifying a glass of water. You care about identifying that one component in your AC, which is broken. And we've only been able to enable that at scale. Before that, you had to spend months of data labeling time and R&D to build a custom model to understand the inner parts of an AC unit,

Starting point is 00:10:47 which only made it viable to companies that had research budgets and we're really focused on a big problem. That has completely changed. And the main reason is that the AI models are able to vacuum up most of the Internet and then train on them. And we discovered that if we gave a lot of money to a few entities that spend billions of dollars to train big models, we end up creating a better outcome

Starting point is 00:11:13 than if a lot of small companies focus on their own domain and create bespoke models for that. It's unfortunately kind of a power law, distribution aspect of the rise of AI. Yeah. Can we dig into the, you mentioned that AI is a data distribution and then LLMs are processing that. Let's talk about the infrastructure and the data layer. I know John has a number of questions here, but I'll kick us off with, can you paint a picture of the differences between just the underlying infrastructure? And it's really funny that you said, you know, this would be

Starting point is 00:11:49 amazing, it would be a crazy experience, you know, but only for an engineer, for the average person, it's like, okay, I put my phone at a glass of water and it says this is a glass of water, that's cool, but an engineer, you know, he would be, you know, amazed by this. Which I actually think is a side note is kind of an interesting description of AI in its best form ever where, you know, it almost disappears and you have this sort of, you know, really fluid experience. But can you talk about the infrastructure then and what you and your team had to build? and the data processing layer versus what you're doing today at V7. How is this substrate of infrastructure changed?

Starting point is 00:12:28 Yeah, absolutely. When we first started V7, the majority of our workflows that we built were training workflows. So they were teaching an AI how to identify objects that general purpose AI did not know. And that aspect was essentially capturing thousands of images of unique objects. and then loading them onto E7, which would use AI to pre-label some of them. So it would segment relevant objects. It would understand what you're trying to identify there.

Starting point is 00:12:58 And then having humans complete the rest of the work, add additional tags, adjust the labels, and then a machine learning scientist would train a bespoke model that can understand these objects, say, again, the parts of an AC or the specific components in a document, like the shipping address and the consignee address and the unique reference number. And only then you could start using these models. And if you had to change anything in these models, you had to go back into the infrastructure,

Starting point is 00:13:28 add additional labels, restart the GPUs, and train a new model, and then push it back into production. Now that has become a lot more fluid. A user can go into B7, drop the document that they're looking to automate and simply explain in plain text what are the fields they want to extract. And if there's anything that AI is struggling with, you just explain in English a little bit more on how to find those specific fields or what rules they need to follow in order to reason through say, what is an acceptable termination clause and what is not an acceptable termination clause in a contract. This has changed the user base from being very technical and scientifically minded users to people that are actually doing the job that AI was being built for. And I think that's a massively

Starting point is 00:14:14 fundamental shift. And the stack or the infrastructure now has moved away from something that leads you to training a model to something that leads you to just showing an off-the-shelf model like GPD5 how to do a particular task that is unique to you or to your business. And effectively, we have moved the workflow from the end goal is to produce a model to the end goal is you start with a model and then given the inputs that you give it, whether there are documents or recording of this call, what do you want to get out of it? And then you're configuring this workflow

Starting point is 00:14:50 so that it can call external apps, send an email, or just understand within this unstructured data context, what do we need to extract, and what guidelines and rules do I need to follow to evaluate what I just extracted? This model is a lot closer to either standard operating procedures within a company or a back office,

Starting point is 00:15:11 and it enables these very back-off. is to effectively replace the most annoying tasks with AI workflows. The infrastructure has therefore moved away from the lab and into the hands of general users. And there's many different types of AI infrastructure as well. This is just the one that V7 specifically caters to. But you can think of it as like the next generation of e-commerce websites may no longer have a process by which they hire a photographer to go and take pictures of models wearing their particular dresses,

Starting point is 00:15:47 but they start by generating them, and then if a product becomes evergreen, then they go through the effort of doing a more refined job, which is getting a real human person to do that task. And whether you like this process or not from a fundamental ethics perspective of hiring a human to do the job, it is the most likely course of action that we will see. And so there's now AI infrastructure components

Starting point is 00:16:11 for almost every operation within a business, And what business leads into ask themselves is, what are the most likely ones that we can apply today? And that would ultimately be fully solvable with AI so that it's worth investing the time and effort into it today. Yeah. Yeah, I think that makes a ton of sense. And I do think that's going to be hard for people, right? In your example, if you, you know, somebody calls you up and said, hey, I know that we normally use you like to, you know, demo or try on our new product. or whatever, and then, like, we don't need you anymore.

Starting point is 00:16:46 We're replacing you with AI. I think that's going to happen to people. And I think, and we can, we've got, we're going to dig into that topic a little bit more later. But on the infrastructure side, I want to talk, we've got a number of engineer types that listen to the show. I'd really like to, like, paint this spectrum from people. So let's start with the marketing.

Starting point is 00:17:09 The marketing says, like, we have AI agents, we have agent swarms, we have, like, like whatever key buzzword you want to use. And it can do all these magical things. Like that's the marketing side of things. But then from an implementation side, from what I've seen and what I've talked to people, there's vastly different implementations. There are some very light-handed implementations

Starting point is 00:17:31 that are pretty simple workflow tools and you do a little custom prompting and you do like, you know, stitch them together. And, you know, there's that. There's some very more complex implementations. where you've got each component separate and you're doing, you know, vectorization and custom tuning of the models

Starting point is 00:17:49 or maybe you build your own model. It'd be fun to kind of talk through your thoughts on that. So when you talk to companies, like, I imagine a lot of these people, a lot of people that are kind of into AI, are aware of some of the consumer-grade tools, like an NADM, for example. Not that, like, you can use it.

Starting point is 00:18:08 You know, not that companies don't use it for some things. But I'd love to talk through that tooling world from like a end user that's kind of like getting into AI versus like true production AI like at scale for some of these like Fortune 500 type companies. And what's the difference and what should people think about? When should they know like, hey, I'm hitting a limitation of like my, you know, alpha project to my little, you know, project that I'm working on the side? Yeah, great question. So I think the simple stuff will eventually be eaten by the LM providers themselves.

Starting point is 00:18:41 Yeah. And simple stuff is your typical like Zapp from three years ago, which was full data from this app, applied some heuristics, push data into this app. And that is actually a lot of, a lot of tasks that are being done today. And just by nature, they exist because they're simple to configure. What's much harder to configure is build a system that helps us decide whether to buy a building or not, or like an asset manager. which normally is something that you give to an analyst that is paid quite a lot.

Starting point is 00:19:14 And they have all this inherent information about how to do the job, what the company needs, what are the way that evaluates a particular property, and ultimately like who to go to if there's a decision to be made. And this just expands the number of notes by quite a bit. And the infra ends up looking like you still have inputs, which are usually a document, reaching you by email, or a person just saying, evaluate this to an agent, and dropping in a PDF.

Starting point is 00:19:41 Then there's all these reasoning steps in the middle, which is probably the biggest value driver at the time, both for B7 and I think of AI in general. And then there's outputs. And a lot of startups tend to focus on either straight-up output generation or making it easy to, like Zap here, to take an input and then push out outputs. Because it's very minimal to consumers

Starting point is 00:20:05 and it's a low-hanging fruit for a lot of small companies, but in an enterprise, it's very unlikely that you have like an ABC workflow. You have a lot of steps in the middle before an output is generated. And that output can be an IC memo telling people, hey, we should buy this building. It can be a slide deck or it can just be a decision. Yes, no. Or like, let's update the CRM to say that this is a deal to be made. But what happens in the middle of the sandwich is really complex.

Starting point is 00:20:37 there's knowledge bases that need to be queried, there's chain of thoughts that need to be developed, and some of these are inherent to the process of the company, and some of them are crutches to just the way LLM's work and making them work more efficiently. The former is the equivalent of memory. You have a vector database. You pull information from it. That helps you understand how to solve a particular task or subtask. The second is about breaking any reasoning task into a smaller subtask.

Starting point is 00:21:10 And the reason why this is done is because LMs are largely trained on question answer pairs that are part of short conversations. It's unlikely that there's a lot of training data in these models that is of the nature of here's an offer memorandum for a building. Think like an investment manager and tell me whether you should buy it or not. And then it ends up producing a 200-presenter. page thesis document. Very unlikely. And so the best way to do it is to break it down into individual steps where every step is effectively a prompt and expected response with customized inputs.

Starting point is 00:21:47 The input usually starts with the core document. As you move along, you may use some of these previous outputs like, hey, let's evaluate the financial performance of this building from the past five years. Then you use that financial performance analysis as a downstream input to another property, in this case, not a building property, but a data property, to then at the end, come up with a final decision or output after all this like forced reasoning and forced filtering. And if you don't force it, you probably are not going to get a really good result that is up to spec to, you know, what a Fortune 1000 will do. The reason why we like these reasoning models so much is that they kind of do the reasoning flow

Starting point is 00:22:29 for us. And for every topic that they are smarter than us at or better than us at, it feels really good. But it's very frustrating when you are a lawyer, you ask a legal question to the model, and it starts making all of these assumptions that are not grounded in the way you actually do the job. And so a lot of this inference about actually grounding the model into, we are this business, and this is how we run the specific process. If you follow these steps, you will be successful. Okay, that makes a lot of sense. So if it's a simple, so let's think about, like, your original example, I'm using Zapier to, you know, currently to do this thing to, like, tag all my emails based on deterministic rules.

Starting point is 00:23:11 I want to use an LLM to make this a little bit better. I'm going to use Baby Zapier to better categorize my, you know, email. Very simple tech test. So I guess, I guess from there, would I imagine other than the technical, side. So we're going to kind of get into this AI implementation topic a little bit. I imagine that a lot of

Starting point is 00:23:34 the hard part here, there's some hard parts with like, you know, tuning models, getting context right. But a lot of the hard part here is still working with the humans to have SOPs of like what needs to get done anyways. If you've ever sat down with a team to like develop an SOP or even we were talking

Starting point is 00:23:52 before the show about like clear objectives, it's actually a fair amount of work just to like map out a clear objective and like the let's use okay ours for example like a clear objectives and key results like that's actually a lot of work to even do that. So I'm curious about strategy like maybe that you

Starting point is 00:24:08 guys have developed as you help people think through like what are we actually doing here in a system thinking where there's a bunch of people involved and it's not just one person's work for it's universally hated and the rules of construction are written in blood and

Starting point is 00:24:24 it's big to anyone in the construction field They're intelligent people who just want to do the thing the way they know how to do the thing. Yeah. And usually processes are only documented when something bad happens. And someone reacts by saying, okay, let's implement a way to do this properly every time. Somewhere, there's also an insurance company that it forces it. Yeah, so they can get covered. They still remain universally hated exercises.

Starting point is 00:24:52 And I think one way to hate it less is that, as you develop the SOP, which we see it just as a list of prompts effectively, you're actually seeing the work get done in real time. So one way we sort of solved it is that as you develop an agent in B7 Go, you're effectively adding all these rules and in English language, something that looks a lot like a standard operating procedure. And every time you add something, it recalculates your workflow and then you can see what the end results end up looking like.

Starting point is 00:25:24 So you're almost like simulating the job. over and over until you're happy with covering all the edge cases through the SOP. But the problem remains that almost no process, especially in enterprise, is known by one single person. And we're sort of like biased by usually being in startups or being in small businesses where you have that one person that kind of knows everything and knows how everything should be done. But companies are a bit more like ASML where, or large companies, where like There's no person that actually knows, sorry, at TSM, more so than ASML, but probably both of them. No person actually knows how developing chip actually works end to end.

Starting point is 00:26:04 It's sort of like this wisdom of many people that know their specific niche. And it's quite similar in, even though allegedly simpler in enterprises. And the difficult part is to get the right people to put the work into documenting how their specific slice of the pie works. And I think the only way to do it effectively is to give an incentive for people to just go into a platform and document this thing as a set of prompts. And I think the wrong way of doing it is to have effectively someone that goes and does all these user interviews like your typical for the float engineer until they build a full mental model of the system. Yes, it works, but it doesn't work organically. And I don't have a full answer on what the best organic model for it is. but I think there is a future by which

Starting point is 00:26:52 any company's biggest asset is its knowledge base and it is the aggregation of what AI has been able to observe across any of our work streams and document effectively their own opinion on how processes are run and that can be used as a knowledge base for running any other process. Hypothetically, if you had an AI that was monitoring your laptop 24-7, I don't think that is a viable solution, but hypothetically, he would be able to keep writing to a document to say, this is how John does business.

Starting point is 00:27:26 And then if it does it for John, for Eric's, for Brooks, for everyone, then over time, it starts to develop a pretty good thesis of how processes are run. Today, you kind of have to force that process it. You kind of have to get the humans to go and do that documentation. Yeah. Yeah. Well, that's exactly where I was kind of going with this is, you're right. Like people, most people really hate making stopping slowing down, making SOAPs, making a documentation. Most people really hate it.

Starting point is 00:27:55 And my question was going to be along the lines like, well, you know, how do we use AI to do that? And to your point, like, maybe there is a future state where you can essentially have, you know, the AI, you know, recording everything, right? But in the mean, but that's probably not practical yet. So in the meantime, I saw an article this weekend, and this is what made me think about this topic, where there's some of these robotics companies targeting smaller manufacturers, where you're able to essentially record with your hands, like, what you'd like the robot to do, and then the robot learns from that versus having to program the robot with code. What do we have that exists today that you think is useful with that same maybe mechanism,

Starting point is 00:28:43 with an LLM, other than like a prompt, obviously. But yeah, are there some like similar things that would feel a little more natural to people, you think? Something that we tried is to use screen recordings as training data effectively to turn it into prompts. And it works surprisingly well, but it doesn't cover every edge case. But it is a good way to, like we do have some customers that come to us, say we don't have a documented process,

Starting point is 00:29:07 but we have a lot of training videos on how to do this process. So we put the training videos through an LLM, then through that we generate effectively adjacent that can be imported into V7 to turn to create a workflow. And that worked surprisingly well. And maybe like the closest proxy to that in the research side is that computer use within NLMs is largely being taught as this like set of steps that a human is doing, like, you know, order a burger on DoorDash depending on where my address is.

Starting point is 00:29:38 And here's like a list of my preferences. and a human would go and try to fulfill that specific past and create training data for it. I think it's a good way to do that, but it's still, I don't know, I think it still doesn't solve the problem of like not everyone knows how specific process is run. But one thing that we do at D7 internally a lot is we use Lume

Starting point is 00:30:04 to teach a man to fish for almost any process. So if someone asks like, hey, how do I write, register time off or request a budget to do this particular event, then we just send them a loom on how to do that. And that very loom could be used to just teach a model that can then straight up answer or do the task. Yeah. Yeah, that makes sense. Yeah, it's super interesting because the, I agree that we're in the early stages of is there a really, you know, sort of elegant, you know, one-click way to do this. you know, that's still to be seen. However, as I think about the types of data, or let me be more specific, the data formats in which those processes exist or in which they can be extracted, right?

Starting point is 00:30:52 So we have a loom video, we have, you know, Notion or Google Docs, you know, that someone creates to explain the process step by step. You could even, you know, have someone explain it and spoken word. and all of that's, you know, sort of, you know, longer form unstructured data in various different formats, which happens to be like, you know, a marvelous input for a large language model. And so I'm pretty excited because I think that's going to rapidly become easier and easier because the formats are not a problem, whereas before that was a major, you know, super expensive technical problem. And that's not the case anymore, right?

Starting point is 00:31:31 And so I'm pretty excited about what we'll see happen in the next couple of years. Yeah, 100%. It's like the field of meta-learning and research is like learning how to learn might be the next big hill to climb for any of these models. We're going to take a quick break from the episode to talk about our sponsor, Rudderstack. Now, I could say a bunch of nice things as if I found a fancy new tool. But John has been implementing Rudderstack for over half a decade. John, you work with customer event data every day,

Starting point is 00:32:00 and you know how hard it can be to make sure that data is clean and then to stream it everywhere it needs to go. Yeah, Eric, as you know, customer data can get messy. And if you've ever seen a tag manager, you know how messy it can get. So Rudderstack has really been one of my team's secret weapons. We can collect and standardize data from anywhere, web, mobile, even server side, and then send it to our downstream tools. Now, rumor has it that you have implemented the longest running production instance

Starting point is 00:32:29 of Rudder Stack at, six years and going. Yes, I can confirm that. And one of the reasons we picked Rudder Stack was that it does not store the data and we can live stream data to our downstream tools. One of the things about the implementation that has been so common over all the years and with so many rudder stack customers is that it wasn't a wholesale replacement of your stack. It fit right into your existing tool set. Yeah, and even with technical tools, Eric, things like Kafka or PubSub, but you don't have to have all that complicated customer data infrastructure. Well, if you need to stream clean customer data to your entire stack,

Starting point is 00:33:06 including your data infrastructure tools, head over to rudderstack.com to learn more. Alberto, let's take a little bit deeper into V7 in this context. So you've brought up a couple of use cases, but I'd love for you to talk about what are the examples of where V7 is creating just immense, you know, sort of asymmetric leverage. for a business. And I know you work in multiple verticals. I'd love a couple of those examples. And then follow that up with, you know, here's why we do not, here's what we're not building

Starting point is 00:33:39 because we don't think that V7 could be really great and create asymmetrical leverage, you know, for these other use cases. Excellent. Yeah. So we work in several industries and maybe in order of size of business, it's financial services, insurance, real estate, tax and legal, And then a long tail from there that includes logistics. And what all of these industries have in common is that they handle a lot of paperwork. They have a general idea of how the paperwork should be handled. And very few people at these businesses enjoy that specific task. And the use cases themselves range from high complexity, high risk use cases that we can handle very well.

Starting point is 00:34:26 For example, in finance, it's a confidential. information memorandum, which is basically a pitch deck for acquiring the business, requires five to seven hours to process from an analyst's perspective to turn it into an IC memo or like something that gets getting to the investment committee. And an agent built through V7 takes about 15 minutes of processing time, including some review time by the human. So that's up 50 next improvement on the actual time dedicated to it, all the way to much higher volume use cases like understanding tax claims, understanding logistics slips, or even insurance

Starting point is 00:35:03 slips, which usually take, you know, five to ten minutes to process manually, but they are huge in volume. And they're usually a big cost carrier to the actual service that the company is fulfilling. We, like, I'm calling from the UK and the NHS spends about 40% of its budget in administrative costs. That's our healthcare system. All these administrative tasks, by and large, can be taken care of by AI to reduce that time frame quite significantly. So there's a big cost saving in general, and we're doing things rather inefficiently today when you imagine that that AI is possible. But on the other side, I think one of the biggest points of our OI is not easily measurable, and it is the lead time between an intent and getting a response back. So good to the

Starting point is 00:35:56 example of that is that we've all been in a pinch and needing something to be reviewed by an expert. An NDA could be an easy example that many can relate to. If you're doing a deal, 48 hours is a huge time frame. And like 24 hour turnovers for NDA review is already pretty darn fast. But if you start to consider almost every process in a company to now be a five-minute turn around time, then the full aspect of work and productivity changes for anyone who perceives this lagging indicator. And if you imagine a business where almost every process requires two days to get back to you, any IT requests, and that's fairly common outside of a startup where you can tap someone on the shoulder, versus a competitor who has this system that is just as reliable,

Starting point is 00:36:49 but it only takes five minutes and that you get your response back. that becomes a massive quality of life improvement to work that I think will encourage people to it's not really necessarily about doing a lot more of the same thing, but being able to do more things because there is a lot of room for interpersonal tasks and creative tasks. And maybe to answer what do we not do particularly well are just those that I've described. we don't want to get into the way of human creativity and human relations. I think fundamentally, I would love for a very smart 11 labs agent to answer my customer support queries, but that's generally because I don't need to develop a relationship with

Starting point is 00:37:33 customer support. But we don't want to get in the way of humans developing relationships with their own customers or with their own people or getting in the way of what is truly creative work. Whilst AI is amazing at producing creative outputs, from art to images to poetry, it is that good because it's read a lot of it. And in a way, it stunts our true creativity because it's not that good at creating something completely net new. And I think a lot of creative aspects is about wowing the world

Starting point is 00:38:04 by creating something entirely net new. Even if it's like designing a new way of doing something, designing a new process, it can be a useful co-pilot to validate your biases and assumptions. But I think that is the work of the future. is designing we're close in the most efficient and fun way possible, and the execution of them will be what we use machines for. Yeah, that makes total sense. And to drill in a little bit on something you said, the difficulty in measuring the ROI of, you know, I mean, okay, if you just look at

Starting point is 00:38:39 the raw percentage of, I go from, you know, a two-day review on an NDA or a contract with a vendor, which anyone who's been in a large organization and has had to go through that, you realize that there are, you know, there's this sort of class of people where they're really good at their job, you know, in terms of just the raw skills that were on the job description,

Starting point is 00:39:00 but they're very effective at their job because they know how to navigate the system more with more agility than other people, right? Oh, hey, I know this person down in legal and I did a favor for the system. them or kids play soccer together or, you know, whatever it is, right? I know their schedule really well, you know, or I've done this 100 times and so I know the process and so they're just able to get stuff done more quickly, right? But that's a horrible use of their time. That's a horrible use of

Starting point is 00:39:29 their like energy is trying to navigate a really complex system to do something, you know, that could be done in five minutes. But I think the reason that's hard to measure is because of the compound effect, right? Like in an isolated one time thing, it's like, okay, well, that's great. Like my week at work was better. But if you multiply that happening a thousand times a week across the entire organization, the time you get back, the energy you get back is it really compounds, right? That's really difficult to measure. But I think to your point, like, well, great, if I don't have to spend, you know, eight out of the next 16 working hours over the next two days, I just slot that off for navigating the system. It's like, well, I have that time back,

Starting point is 00:40:12 right? It's like, well, that's really interesting. 100%. And I think if you think about a sales career, the best salespeople in the world embody that, they find other revenues to remove friction

Starting point is 00:40:25 from their ability to close a deal that are not the standard operating procedure. They meet with their clients. They remember personal details about them. They do extra work for them that goes outside of what one would normally do. And then they eventually become very successful. And the parts they,

Starting point is 00:40:44 they don't want to do, though, are, like, I need to wait two days to get this deal approved or this statement of work approved because, and like the way they go around it is to, I don't, charm their way through it, to go and bother people, to go and push for things. But these are all things that AI is not very good at doing. Yeah, yeah, yeah. Yeah. Well, we're close to time here, but one thing I want to get your thoughts on, especially is you're implementing V7 in these companies. You're removing these barriers. Who is most impacted and what types of roles are at risk? I mean, you know, let's talk about the salesperson that you just mentioned.

Starting point is 00:41:25 You know, they're really good at navigating the system. They're really good at, you know, figuring out how to, I don't want to say cut corners, but they know the process well enough and they've built up enough institutional knowledge of how things operate in the organization to where they can just get things done more quickly than other people, you know. and unfortunately the cost of that is whatever 40, 50% of their job is navigating, not actually selling or building relationships. For that person, this is going to be, you know, sort of like a miracle drug of productivity, but they were already super productive, right?

Starting point is 00:42:01 Do you see that being the case where you sort of have this subset, let's say, you know, half of a team, you know, overindexes for being productive already? Are they the primary beneficiaries of this? you know, who's going to get the most asymmetric value? And I guess I say who's going to, who is, you know, and how does this play out practically on a team, you know, at a company that V7's implemented at? Great question. I think there's effectively three groups of people in the world that follow any process. They're the ones that have a different way of seeing the world. And it is very hard to replicate it. And they're almost like an artist and an operator. And that would be. be the example of a salesperson that figures out that their biggest client goes to

Starting point is 00:42:48 specific golf court so they learn how to play golf to get there. It's a bit of a crazy idea of that they're thinking out of the box. Then there's like efficient or maybe sometimes less efficient operators that are following the rules and they're doing quote well. And then there's folks that understand the actual operation at first principles to the point where they could teach it. And the ones number one and three are the most secure. Number two is the secure because they are, no matter how efficiently they're following the process, which could be developing leads and then doing sales, if they're not having something super new to the process each time and really think it outside the box, they're a risk of automation.

Starting point is 00:43:27 The ones that really understand the process of the first principles have a good opportunity of actually developing those automations. Software is becoming very easy to use, and so they could be the one that can actually automate group number two. Group number one, the creatives will likely not be automated for a very long time, if not ever. And a good proxy to this is manufacturing. We had only artisans in the past. Today we have people that design manufacturing processes. They help both factories.

Starting point is 00:43:59 Sometimes they're engineers, but sometimes not. We have super-skilled creative artisans that will create something that a machine could not make. And they just have this very unique way of seeing the world. and that we no longer have artisans that do something similar to what an automated process can do, at least in modern countries. And so, yeah, I think effectively, like you're either a crazy person that just does things completely out of the box, or you should start to think of how to turn what you're doing on a regular basis into a machine. Yep. I love that. One thing that we've talked about before, John and I've actually talked about this and sort of explaining, you know, different types of people within an organization.

Starting point is 00:44:40 and kind of how to navigate things. But this is a dramatic oversimplification. But, you know, one way you can look at people within an organization is people who create information and then people who move that information around, right? And if all you do is move other people's information around, I agree. I mean, you should be worried, right? If you can't teach, like you said, they have an understanding in the process at the point where they can teach it, but that's actually creating, you know, new valuable information.

Starting point is 00:45:06 So that's, yeah, super interesting. And I've looked at it. I mean, very similar framing, but essentially two jobs and they definitely overlap. One is that like creativity, creative endeavor job and one is that like continuous improvement job. And you can do both for sure, but those are those feel like the, and those are not specific to like a domain. Like sure, you'll be applying that and like marketing or operations or whatever. But I feel like that's what I've told a lot of people like if like continuous improvement

Starting point is 00:45:36 And that's like, you, if you've got AI that can do this, then like, how can AI do it better? And then for their creativity, like that has a million different angles you can take it. But other than that, like, I think there's a lot of change. A lot of change coming. A lot of change you got me. Well, Alberto, we're at the buzzers I'd like to say. But one more question for you, what is like an app or a product that you've used recently that's maybe new to you that really caught your attention? I'm just always interested for people who work in this stuff every day, who are building it at a foundational

Starting point is 00:46:06 level. And, you know, especially with your interest in user experience, is there something you've used recently, even a physical product or even an experience you've had where it just really caught your attention? I have, I think like many people have started to use video models more. And it's sort of, as someone that's been in the research side of AI for some time, it kind of turns into a lightbulb moment where you realize that a next frame prediction engine can do a lot more than just create videos of the foot, but it can actually create full demo experiences down the road. And that could be a nice little game changer that are also interactive. On the interactive side, I forgot what DeepMinds name for those virtual worlds, but there's many startups that do it. One, for example, is Odyssey Systems,

Starting point is 00:46:52 and it's effectively a navigable 3D world that is entirely aged out of it on the fly. And you can think of it as like a video game that continuously renders new things on the spot. And those will be very useful for training certain models that need to be embodied, like self-driving cars and embodied. Sure. But I think they will also have a tremendous impact on entertainment, on VR. And their generally is quite fun because they're exactly like the human mind. We're imagining a world in our heads. And then we're navigating through it. And we use that prefrontal low experience to think about whether our decisions are going to be good or bad. It's actually what differentiates us from a lot of animals that we picture scenarios in our head and we imagine like if I do this is going to happen,

Starting point is 00:47:40 if I do that is going to happen. And we can do it for quite a long period. And so it might even give AI a much better way of reasoning that is not like just spitting tokens out and talking to itself. And yeah, I would say probably these are the two most exciting that are completely outside of our domain. Yeah. We don't really deal with the generation of image assets. Another thing that we are experimented with a lot that I'm having fun with. When you look at B7, it looks a lot like a spreadsheet, but it's actually a database. It's a big table.

Starting point is 00:48:12 And every row is an agent run and various steps that it's taken and the information that is recent. Awesome. Yeah. We're experimenting. What would that look like if it was actually a through spreadsheet where you're not bound to a peer grid where you have like columns and rows, but rather something that's a bit more fluid.

Starting point is 00:48:30 And I think it's the perfect scratch path. for AI to operate in, but it's a thesis that we have not yet fully validated. But I think, like, of all documents that are going to change quite significantly with AI, I think spreadsheets are going to be the one with the biggest impact, because you're no longer doing regular expression. You're doing AI functions and app tool calls and running agents within a cell so it comes back with a particular piece of information. So there's a lot of U.S. changes that are going to occur there. Yeah. Yeah, that's super exciting. I heard a product leader, one time say, you know, you could essentially boil all software down to like forms on a database.

Starting point is 00:49:08 You know, that's basically what software is, right? Yeah, so sure. Which, you know, you're like, whoa, that's overly simplistic. And then you think about it and you're like, yeah, that's actually true, right? But I think what's interesting about what you're saying is that, well, and the point the guy made was like, so it seems like you could, you know, like, well, if Salesforce is just forms on database, like someone's going to build another Salesforce is like, no, the business logic is where they create the lock-in, right? And so I think what's, really fascinating about V7 is you're abstracting the business logic. You're abstracting that out and turning that into an AI layer, you know, on top of the database. And so it's going to be

Starting point is 00:49:44 really interesting to see what happens the next couple of years. Yeah, exactly. That's one way to explain it to an engineer. We are a multi-d database system like Salesforce that is built for AI from the ground up. And as we built this thing, we started to realize that there's maybe a chapter beyond this framework because of AI's capabilities to break out of the mold of an SQL table. Yep, I love it. All right.

Starting point is 00:50:09 Well, Alberta, thank you so much for the time it flew by, and we'd love to have you back on soon to hear about what's new with V7. Thank you. Thanks for having me. The Datastack show is brought to you by Rudderstack. Learn more at rudderstack.com.

The Data Stack Show - Re-Air: The End of Busywork: How AI Transforms Productivity at Scale with Alberto Rizzoli of V7

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.