The Data Stack Show - Shop Talk: Accountability and Opportunity for AI

Episode Date: June 23, 2023

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building a...nd maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

Transcript
Discussion (0)
Starting point is 00:00:00 Welcome to the Data Sack show, Shop.Costas. We have talked with people who built amazing data technology at companies like Netflix, Uber, and LinkedIn. But you and I actually don't record our talks about data very much. But we actually talk about data together a ton. And so Brooks had this amazing idea of just recording some of the conversations that you and I have before and after the show about data and our opinions on it. And really, this has been one of my favorite things that we do. So welcome to Shop Talk. It is where Costas and I share opinions and thoughts on a personal level about what we're seeing in the data space. And it really is simple.
Starting point is 00:00:48 We ask one another a question and the other one tries to answer it. So without further ado, here is Shop Talk. Welcome to the Data Stack Show Shop Talk where Costas and I talk shop. Costas, it's been a while since we've done one of these. Lots has happened. I have a new baby in my household, so that's kept me away from the microphone for a little bit. But this is exciting.
Starting point is 00:01:16 Oh, yeah. Yeah, I miss pre-styling with you, so let's do it again. One of the interesting things about the primary interface and ergonomics of chat gpt is that it's one i think it's amazing because it's just so familiar right uh it just such it's actually almost you know 10 years ago it's like you know, 10 years ago, it's like, you know, the most popular, like the most widely adopted, like,
Starting point is 00:01:56 LLM is going to be like a text exchange. You know, I mean, it almost sounds crazy when you step back and think about it. But brilliant from an interface standpoint. One of the things that's really interesting about presenting a bunch of options as search results that have some sort of ranking that's at least nominally transparent
Starting point is 00:02:15 to the end user who executed the search is that you're presenting some level of choice. One thing that's very interesting about the chat interface is that when you put in a specific query, the results are largely singular, right? And so you're actually not presented with a lot of choice. And that really, in some ways, to return to the accountability question, what's really interesting about presenting choice is that it the there's more
Starting point is 00:02:49 accountability for the user because they have to decide right like i'm offering you a variety of things and i'm applying some weight that may or may not be transparent to you but you ultimately have to decide what you believe to be sort of the best match. But it's very linear and you're getting a pretty singular response. How do you think that causes, I mean, I think it caused, that's one of the roots of the issues of accountability, right? Is that it's almost a binary choice. Like, do I trust this or do i not and i don't have there's no surrounding context that gives me a tool set with which to make that decision yeah yeah 100 and also i think it's even like worse than that because even like the same question if you ask like such dpt like it might might generate a different response.
Starting point is 00:03:48 It's not as deterministic, let's say, as it is when you search on Google. In Google, in a keyword, you're going to get the same results all the time. With the chat GPT, that's not the case. And I think it's one thing, and that's where things are getting quite interesting when it comes to integrating it with products and trying to productize it.
Starting point is 00:04:18 Right? Because how do you account for that, especially in use cases where critical decisions are made? Like, let's say I'm a doctor and I'm using chatGT to summarize stuff, how can we build a user experience product that it will help the doctor to do the summarization, but at the same time will safeguard both the model and the doctor from interpreting the wrong information or you know like making mistakes that it's like hallucinating like interpreting like hallucination like for example let's say just like what the like pretty much like everyone i guess like have heard about like
Starting point is 00:05:21 a large language models so i think that's like the interesting part. And like, that's why I'm very excited. And I think like the people who should be like extremely excited about AI and especially LLMs are product managers. I think there's like a very new modality that is given to them, a very new tool, which is almost like completely uncharted territory right now. And I think there's a lot of space for creativity in product design and management to make some
Starting point is 00:05:56 very interesting and create new types of experience there. Yeah, I totally agree. So I almost, this is distilling it down to like two levels, which is not actually reality. But reality is a construct, so it doesn't matter and I'm going to roll with it. Even the Matrix, right?
Starting point is 00:06:17 Yes. I think that there will be, actually I re-watched, that was on in the hospital when the baby was born. And I forgot how good of a movie that was. It was, I remember watching it for the first time. Yeah. The first one.
Starting point is 00:06:31 Yeah. Like, yeah. Unbelievable. Yeah. But so I, one thing that is a little bit disappointing to me about the hype cycle is that you mentioned it earlier.
Starting point is 00:06:45 This is going to drastically accelerate the amount of human creativity that can be put into jobs by doing things like pulling information out of PDFs and summarizing it, right? I mean, you mentioned the medical industry, right? I mean, there are entire like sections of publicly traded companies that have armies of people who go through PDFs and manually like copy information from a PDF into like some other system, right? I mean, that is wild to think about, right? And it's like, okay, those are creative human beings. If we can remove that from them and allow them to attack different problems, that to me, the first wave of the types of things that this will do will actually be pretty utilitarian like that,
Starting point is 00:07:41 and they will just unlock a bunch of bandwidth, which to me is extremely exciting. I know there's concern about job loss, but I'm with you. I'm actually not concerned about that because I think that it will free up human creativity, which will be a net benefit to society. That's pretty exciting to me. But then I think then you know, once a number of those problems are solved, you, and even now what people like to talk about more is the tricky thing. Should this LM be able to make a diagnosis for a doctor? And where is the jurisdiction, you know, from the doctor and from the patient? And how do you regulate that? And, you know, that's where things get really tricky. And that's where people tend to
Starting point is 00:08:25 focus the conversation because they're like, what's going to happen? You know, is this thing going to like wrongly diagnose your disease? But to your point way earlier, there's going to be monumental changes for things that are not that exciting, like from a news cycle standpoint, right? Yeah, 100%. Actually, I think there is a... I keep sharing these insights because, I don't know, it was very surprising for me when I experienced it. But I think it's a glimpse to the future
Starting point is 00:09:01 of how AI is going to be, in my opinion, implemented and how we are going to embed it in our everyday life. So I don't know. I guess everyone is familiar with Invisalign, right? Even if they haven't used it, they probably know what it is. If you go through the process of doing a treatment within VisaLine, you are going to notice something if you pay attention. Okay? The whole process can happen without actually having a doctor there.
Starting point is 00:09:42 Like, you don't need the doctor. Like, you don't. Even what the doctor is doing when you don't need the doctor. Like you don't. Like even what like the doctor is doing when you go there, like at the beginning, uh, it's something that like you can pretty much do like on your own. Like you scan your teeth, they take these scans with a phone, by the way, you can do it, feed it to the algorithm and the algorithm comes up with a treatment. And the treatment is then fed into a 3D printer. They print the Invisalign, the molds, and they send them to you. But no one does that.
Starting point is 00:10:16 We always go either to a dentist or an orthodentist, and we go like through this process and the reason we do that is because the modality is like so so different that like humans would never like trust that right like it will take time to get to the point where you're like i'm going to put something in my mouth for like the next six months that is completely generated by a machine and it is completely generated by a machine. And it is completely generated by a machine. The doctor there is there to take care of like, okay, some cases that the, like the edge cases of the algorithm or like complications, right? And most importantly, to make you feel safe and trust to the process.
Starting point is 00:11:01 Okay. Because you are interacting with a human, right? Yeah. Now, the benefit, did the orthodontist lost their jobs? No. Just that more and more people at the end got access to this kind of treatment in lower prices and they scaled their jobs because now they can, in their offices, they can have 5x
Starting point is 00:11:25 the patients that they had. I think that's what is going to happen with other professions that you have lawyers interacting with the public sector. All these insurance claims.
Starting point is 00:11:44 All that stuff. We will just be able like to do more. And that's going to be like, in my opinion, like a positive sum game at the end compared to the probably negative sum game that like crypto was. Yep. Yep. Yeah, that's super interesting okay one last one last thing what have you built and have you built anything with an flm yet yeah i'm using
Starting point is 00:12:18 i'm using them like a lot i mean i'm using them yeah like i I mean, I'm using them. Yeah. Sorry, beyond just a personal, because I think a lot of people who are interested in it use it for different things on a daily basis by just sort of inputting a bunch of prompts. But have you, like... I think you've built a couple end-to-end things on it, right? Yeah, I'm building stuff. Yeah, like, for example like one of the ways that like, and it's funny because you
Starting point is 00:12:48 mentioned something similar, like for example, I wanted like to build like a synthetic data generator and you know, especially when you have like low cardinality data that you need there. it can be like super helpful because it can create like much more realistic data. So for example, let's say I want to even like with, let's take like the, a rather stack event, right? Like if you want to create synthetic data for the agent field, instead of trying to figure out and putting just a junk of characters there, you can get, I don't know, a variation of like a hundred of them and just sample them and use them to generate data.
Starting point is 00:13:46 And you don't have to continuously do that. It's just to seed your data with that because it's slow to do it every time and expensive. But I'm here to do these kind of things. I'm using it to code. It's super useful,
Starting point is 00:14:02 especially... I was never the person that enjoyed writing tests, so now it's pure joy to write tests using Copilot. I'm doing a lot of... I have used Whisper a lot to analyze the episodes here that we are recording. I'm using MidJourney to generate some assets for presentations and stuff like that. By the way, keep in mind, I can't even draw a straight line. For me, it's a huge enabler that I can go there and create something decent. It's not like it's going to be perfect. It will never be perfect. And yeah, I will need a designer. I will never substitute the designer. But at the end, the designer can iterate much more on what I'm asking because of these tools.
Starting point is 00:15:08 So we can end up in what I need faster. What else have I done? I think these are the things so far. I'm using it a lot while I code, to be honest. It's very interesting. It cannot substitute coding but it changes the experience a lot what has changed is that I'm not using Stack Overflow anymore I mean I'm not writing code in the same way that you are
Starting point is 00:15:36 but querying data a lot I've never had a job where one of my stated responsibilities was writing SQL or Regex stuff. And so if you've never done that professionally, and I just haven't spent the time to actually get over the hump of figuring out how to structure a query in a way that is efficient and whatever, it is amazing
Starting point is 00:16:05 for that where it's like i'm trying to answer a question and i know like the basic structure of how it should look and it's like just corrects this yeah it's amazing yeah the other thing that's also like connects with a conversation about like google for example, they're like, okay, I've never learned or there are specific terminal commands that I keep forgetting exactly what the parameters are. Yeah. One of the biggest problems that I have is I always forget how to do an SCP.
Starting point is 00:16:40 And I usually just go and Google it because you pretty much get the exact answer when you want. But what happens is like people have like games assistant show maths where you are like ending like in a landing page where there is the command. And there's show maths advertisement on it. Like show me like it's so annoying. Like I can't see anything. I know exactly what you're talking about. Yeah, so
Starting point is 00:17:06 now, instead of doing that, I'm just asking the LLM and it gives me the commands. Even if it doesn't work, it's close to what I need, right? But I didn't have to go through all that crap of all these stupid ads. I totally agree. I think also, you know,
Starting point is 00:17:29 Google in a way also heard themselves by allowing this to happen. Because at the end, it's all about the experience. It's the same information. There's nothing creative there. Totally agree, yeah. It's just the experience're really bad on Google. Yeah, it's an absolutely fascinating study in user interface.
Starting point is 00:17:56 I totally agree in accessing information. It's also going to be really interesting to see how this impacts Microsoft and kind of what that does in the market with the big players, which will be super fun. All right. Well, Brooks is telling us we are at time here. And maybe we'll try to, I don't know, we should
Starting point is 00:18:22 try to ask an MLM to generate an entire episode around a topic and see if they get our tone and voice? That would be interesting. We can do some, we can be creative. Let's have like a sub GPT as a guest. Yes. And see what happens. That would be fun.
Starting point is 00:18:43 Let's do that. All right, we'll try it. All right. Well, thanks everyone for joining us for another shop talk. Costas, thank you for sharing all of your thoughts, even if they're, you know, just a construct and we will catch you on the next one. You know, Costas, we learned so much from the data leaders that we talked to, but I learned so much from picking your brain and actually your questions really make me think really hard. So I appreciate Shop Talk. I think it makes me a sharper thinker.
Starting point is 00:19:13 Well, it's fun. Like, I think it's good to just sit and chat about the stuff that we experience. And yeah, I think like I hope like people enjoy it. That's why I'll keep asking for people to reach out.
Starting point is 00:19:29 Please do this. Come on, Pog. Like you can do that. Like send an email. Yeah. Let us know how you feel and like, what are your opinions of like your experience with the show? So please do that.
Starting point is 00:19:43 So me and Derek, we can keep being happy. Of course. And of course, we try to take the same types of questions to data leaders from all sorts of companies, large and small. So definitely subscribe to the main show if you haven't yet.
Starting point is 00:20:02 Tons of really good episodes there and tons of really good thoughts from data leaders, you know, really around the world. So definitely subscribe if you haven't and we'll catch you on the next Shop Talk.

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.