The Data Stack Show - The PRQL: Can Machine Learning Be Commoditized?

Episode Date: August 19, 2022

In this bonus episode, Eric and Kostas preview their upcoming live stream episode featuring Willem Pienaar of Tecton and Tristan Zajonc of Continual. ...

Transcript
Discussion (0)
Starting point is 00:00:00 Welcome to the Data Stack Show prequel. We just got done doing one of our favorite things, which is a Data Stack Show live stream. And the topic was ML. And we talked with two brilliant minds, Tristan from Continual AI and Willem from Tekton. And one of the most interesting parts of the chat to meet Costas was the discussion around
Starting point is 00:00:30 what components or even use cases for ML are essentially commoditized or known quantities that are extremely common and have a huge number of patterns around them. So things like churn, you know, any, you know, things like churn recommendations, et cetera. And the discussion around, okay, like these are fairly known quantities where you're changing the inputs and maybe varying the model a little bit, depending on the business use case. I believe Tristan used GitHub's AI code assistant as an example, right? Where it was a huge bet, AI-driven feature, and had a huge payoff and was net new, right? It wasn't optimization.
Starting point is 00:01:37 It was actually adding value. So that was such a fascinating discussion to me in terms of almost a bifurcation of ML into those two categories. What do you think? Yeah, I agree with you. I think, I mean, what I found like super interesting is that I think people, when we're talking about ML, they're still, the first thing that they recall is like, you know, like self-driving cars and I
Starting point is 00:02:06 don't know, AI generated art and like all that stuff, but at the end, this is like, just like, let's say the state of the state of the art right now. And it's probably like, just like the tip of the iceberg. I think the real value in the markets and the economy out there comes from AI that is applied on very structured data and to problems like prediction and optimization in logistics and boring stuff. And what I found super interesting about that is what they said about why these solutions, yeah, they're getting like, let's say the infrastructure might get commoditized, but at the end, its company wants to build, let's say, their own solution
Starting point is 00:03:01 or compete on the solution because it is like a competitive IP that they can. Yeah, that was fascinating. Right. So, yeah, you don't want to have the same trend prediction with all your competitors because at the end, you don't get any heads up. Yeah, that's right. Totally. That was a very interesting part of it. I think it resonates a lot with what like some investors at least say that like it's really difficult.
Starting point is 00:03:29 Like it's like doesn't make sense to create, let's say, products or modes or like try to do that like over data and these kind of like solutions because at the end they end up being services because they need a lot of information for its customer. Which is, yeah, I think both of my experience so far from what I've seen, it's true. Totally agree. And you're definitely
Starting point is 00:03:55 going to want to catch this one if you are working anywhere around ML. We talked about the state of MLOps infrastructure and tons of difficulties there. We talked about the gap between ML Ops infrastructure and tons of difficulties there. We talked about the gap between running ML models to drive insights and actually delivering outputs of models in real time, which is a huge technical gap. And then we also talked about the importance of open source in the ML space. So tons of fascinating stuff from two really smart people.
Starting point is 00:04:28 Subscribe if you haven't. Tell a friend. Share it with your friendly data scientist or data science team. And we'll catch you in the next one.

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.