Screaming in the Cloud - Taking a Hybrid AI Approach to Security at Snyk with Randall Degges

Starting point is 00:00:00 Hello, and welcome to Screaming in the Cloud, with your host, Chief Cloud Economist at the Duckbill Group, Corey Quinn. This weekly show features conversations with people doing interesting work in the world of cloud, thoughtful commentary on the state of the technical world, and ridiculous titles for which Corey refuses to apologize. This is Screaming in the Cloud. Welcome to Screaming in the Cloud. I'm Corey Quinn, and this featured guest episode is brought to us by our friends at Snyk. Also brought to us by our friends at Snyk is one of our friends at

Starting point is 00:00:42 Snyk, specifically Randall Deggs, their head of developer relations and community. Randall, thank you for joining me. Hey, what's up, Corey? Yeah, thanks for having me on the show, man. Looking forward to talking about some fun security stuff today. It's been a while since I got to really talk about a security-centric thing on this show, at least in order of recordings. I don't know if the one right before this is a security thing. Things happen on the back end that I am blissfully unaware of. But it seems the theme lately has been a lot around generative AI. So I'm going to start off by basically putting you in the hot seat. Because when you pull up a company's website these days,

Starting point is 00:01:21 the odds are terrific that they're going to have completely repositioned absolutely everything that they do in the context of generative AI. It's like, we're a generative AI company. It's like, that's great. Historically, I have been a paying customer of Snyk so that it does security stuff. So if you're now a generative AI company, who do I use for the security platform thing that I was depending upon? You have not done that. First, good work. Secondly, why haven't you done that? Great question. Also, you said a moment ago that LLMs are very interesting or there's a lot of hype around it. Understatement of the last year, for sure. Oh my god, it has gotten brutal. I don't know how many billions of dollars have been dumped into

Starting point is 00:02:03 LLMs in the last 12 months, but I'm sure it's a very high number. I have the sneaking suspicion that the largest models cost at least a billion each to train, just based upon, at least retail price, based upon the simple economics of how long it takes to do these things, how expensive that particular flavor of compute is. And the technology is magic. It is magic in a box. And I see that. But finding ways that it applies in different ways is taking some time,

Starting point is 00:02:31 but that's not stopping the hype beasts. A lot of the same terrible people who were relentlessly pushing crypto have now pivoted to relentlessly pushing generative AI, presumably because they're working through NVIDIA's street team or their referral program or whatever it is. It doesn't matter what the rest of us do as long as we're burning GPU cycles on it. And I want to distance myself from that exciting level of boosterism. But it's also magic.

Starting point is 00:02:56 Yeah. Well, let's just talk about AI and security for a moment and answer your previous question. So what's happening in space? What's the deal? What is all the hype going to? And what is Snyk doing around there? So quite frankly, and I'm sure a lot of people on your show say the same thing, but Snyk isn't new into the AI space. It's been a fundamental part of our platform for many years now. So for those of you listening who have no idea what the heck Snyk is, and you're like, why are we talking about this? Snyk is essentially a developer security company. And the core of what we do is two things. The first thing is we help scan your code, your dependencies, your containers, all the

Starting point is 00:03:34 different parts of your application and detect vulnerabilities. That's the first part. The second thing we do is we help fix those vulnerabilities. So detection and remediation. Those are the two components of any good security tool or security company. And in our particular case, we're very focused on developers because our whole product is really based on your application and your application security, not infrastructure and other things like this. So with that being said, what are we doing at a high level with LLMs? Well, if you think about AI as a broad spectrum, you have a lot of different technologies behind the scenes that people refer to as AI.

Starting point is 00:04:11 You have lots of these large language models, which are generating text based on inputs. You also have symbolic AI, which has been around for a very long time, and which is very domain-specific. It's like creating specific rules and helping you pattern detection amongst things. And those two different types of applied AI, let's say we have large language

Starting point is 00:04:31 models and symbolic AI are the two main things that have been happening in the industry for the last tens of years, really, with LLMs being the new kid on the block. So when we're talking about security, what's important to know about just those two underlying technologies? Well, the first thing is that large language models, as I'm sure everyone listening to this knows, are really good at predicting portions of the internet, downloaded tons of data, classified it, and then trained their models on top of this data so that they can help predict the things that people are putting into chat. And that's why they're so interesting and powerful, and there's always cool use cases popping up with them. However, the downside of LLMs is because they're just using a bunch of training data behind the scenes, there's a ton of room for things to be wrong. Training data sets aren't perfect. They're coming from a bunch of training data behind the scenes, there's a ton of room for things to be wrong.

Starting point is 00:05:25 Training data sets aren't perfect. They're coming from a ton of places. And even if they were perfect, there's still the likelihood that things that are going to be generating output based on a statistical model isn't going to be accurate, which is the whole concept of hallucinations. Right. I wound up remarking on the live stream for GitHub Universe a week or two ago that the S in AI stood for security.

Starting point is 00:05:48 One of the problems I've seen with it is that it can generate a very plausible looking IAM policy if you ask it to, but it doesn't actually do what you think it would if you go ahead and actually use it. I think that it's still squarely in the realm of it's great at creativity, it's great at surface level knowledge. But for anything important, you really want someone who knows what they're doing to take a look at it and say, I saw your role there, Spasty Pudding. 100%. And when we're talking about LLMs, I mean, you're right. Security isn't really what they're designed to do, first of all. They're designed to predict things based on statistics, which is not a security concept. But secondly, another important thing to know is when you're talking about using LLMs in general, there's so many tricks and techniques and things you can do to improve accuracy and improve things like, for example, having a ton of context or doing

Starting point is 00:06:37 few-shot learning techniques where you prompt it and give it examples of questions and answers that you're looking for can give you a slight competitive edge there in terms of reducing hallucinations and false information. But fundamentally, LLMs will always have a problem with hallucinations and getting things wrong. So that brings us to what we mentioned before, symbolic AI and what the differences are there. Well, symbolic AI is a completely different approach. You're not taking huge training sets and using machine learning to build statistical models. It's very different. You're creating rules

Starting point is 00:07:07 and you're parsing very specific domain information to generate things that are highly accurate. Although those models will fail when applied to general purpose things, unlike large language models. So what does that mean? You have these two different types of AI

Starting point is 00:07:22 that people are using. You have symbolic AI, which is very specific and requires a lot of expertise to create. Then you have LLMs, which take a lot of experience to create as well, but are think fundamentally, one of the things that separates Snyk from a lot of other companies in the space is we're just trying to do whatever the best technical solution is to solve the problem. And I think we found that with our hybrid approach. I think that there is a reasonable distrust of AI when it comes to security. I mean, I wound up recently using it to build what has been announced by the time this thing airs, which is my reInvent Photo Scavenger Hunt app. I know nothing about front end, so that's okay. I've got a robot in my pocket. It's great at doing the development of the initial thing, and then you have issues and you want to add functionality.

Starting point is 00:08:22 And it feels like by the time I was done with my first draft, that 10 different engineers had all collaborated on this thing without ever speaking to one another. There was no consistent idiomatic style. It used a variety, a hodgepodge of different lists and the rest. It became a bit of a Frankenstein's monster. That can kind of work if we're talking about a web app that doesn't have any sensitive data in it. But holy crap, the idea of applying that to... Yeah, that's how we built our bank security policy is one of those... Let me know who said that so they can not have their job anymore territory when the CISO starts hunting. You're right. It's a very tenuous situation to be in from a security perspective. The way I like to think about it, because I've been a developer for a

Starting point is 00:09:02 long time and a security professional. And I, as much as anyone out there, love to jump on the hype train for things and do whatever I can to be lazy and just get work done quicker. And so I use ChatGPT, I use GitHub Copilot, I use all sorts of LLM-based tools to help me write software. And similarly to the problems when developers are not using LLMs to help them write code, security is always a concern. It doesn't matter if you have a developer writing every line of code themselves or if they're getting help from Copilot or ChatGPT. Fundamentally, the problem with security

Starting point is 00:09:35 and the reason why it's such an annoying part of the developer experience, in all honesty, is that security is really difficult. You can take someone who's an amazing engineer who has 30 years of experience. You could take John Carmack, I'm sure. One of the most legendary developers to ever walk the earth.

Starting point is 00:09:52 You could sit over his shoulder and watch him write software. I can almost guarantee you that he's going to have some sort of security problem in his code, even with all the knowledge he has in his head. And part of the reason that's the case is because modern security is way complicated.

Starting point is 00:10:06 Like if you're building a web app, you have front end stuff you need to protect, you have back end stuff you need to protect, there's databases and infrastructure and communication layers between the infrastructure and the services. It's just too complicated for one person to fully grasp. And so what do you do? Well, you basically need some sort of assistance from automation. You have to have some sort of tooling that can take a look at your code that you're writing and say, hey, Randall, on line 39, when you were writing this function that's taking user data and doing something with it, you forgot to sanitize the user data. Now, that's a simple example.

Starting point is 00:10:38 But let's talk about a more complex example. Maybe you're building some authentication software and you're taking users' passwords and you're hashing them using a common hashing algorithm. And maybe the tooling is able to detect, well, hey, using the bcrypt password hashing algorithm with a work factor of 10 to create this password hash. But guess what? We're in 2023. And a work factor of 10 is something that older commodity CPUs can now factor at a reasonable

Starting point is 00:11:02 rate. And so you need to bump that up to 13 or 14. These are the types of things where you need help over time. It's not something that anyone can reasonably assume. They can just deal with in their head. The way I like to think about it is as a developer, regardless of how you're building code, you need some sort of security checks on there

Starting point is 00:11:20 to just help you be productive in all honesty. If you're not doing that, you're just asking for problems. Oh yeah, on some level, even the idea of it's just going to be very computationally expensive to wind up figuring out what that password hash is. Well, great. One of the things that we've been aware of for a while is that given the rise of botnets and compromised computers,

Starting point is 00:11:41 the attackers have what amounts to infinite computing capacity, give or take. So if they want in on some level badly enough, they're going to find a way to get in there. When you say that every developer is going to sit down and write insecure code, you're right. And a big part of that is because, as imagined today, security is an incredibly high friction process. And it's not helped, frankly, by tools that don't have nuance or understanding. If I want to do a crap ton of busy work that doesn't feel like it moves the needle forward at all, I'll go around to resolving the hundreds upon hundreds of dependabot alerts I have for a lot of my internal services that write my weekly newsletter, because some dependency three deep

Starting point is 00:12:21 winds up having a failure mode when it gets untrusted input of the following type. It can cause resource exhaustion. It runs in a Lambda function, so I don't care about the resources. And two, I'm not here providing the stuff that I write, which is the input, with an idea toward exploiting stuff. So it's busy work, things I don't need to be aware of. But more to the point, stuff like that has the high propensity to mask things I actually do care about. Getting the signal from noise from your misconfigured, ill-conceived alerting system is just awful. A bad thing is there are no security things for you to work on, but a worse one is here are 70,000 security things for you to work on.

Starting point is 00:13:01 How do you triage? How do you think about it? 100%. I mean, that's actually the most difficult thing I would say that security teams have to deal with in the real world. It's not having a tool to help detect issues or trying to get people to fix them. The real issue is, there's always security problems, like you said, right? Like, if you take a look and just scan any code base out there, any reasonably sized code base, you're going to find a ridiculous amount of issues. Some of those issues will be actual If you take a look and just scan any code base out there, any reasonably sized code base, you're going to find a ridiculous amount of issues. Some of those issues will be actual issues, like you're not doing something in code hygiene that you need to do to protect stuff.

Starting point is 00:13:33 A lot of those issues are meaningless things. Like you said, you have a transitive dependency that some direct dependency is referring to. And maybe in some function call, there's an issue there and it's alerting you on it, even though you don't even use this function call. You're not even touching this class or this method or whatever it is. And it wastes a lot of time. And that's why the holy grail in the security industry, in all honesty, is prioritization and insights. At Snyk, we pioneered this concept of

Starting point is 00:14:01 ASPM, which stands for Application Security Posture Management. And fundamentally, what that means is when you're a security team, and you're scanning code and finding all these issues, how do you prioritize them? Well, there's a couple approaches. One approach is to use static analysis to try to figure out if these issues that are being detected are reachable, right? Like, can they be achieved in some way? But that's really hard to do statically. And there's so many variables to go into that no one really has foolproof solutions there. The second thing you can do is you can combine insights and heuristics from a lot of different places. So you can take a look at static code analysis results, and you can combine them with agents running live that are observing your application.

Starting point is 00:14:45 And then you can try to determine what stuff is actually reachable, given this real-world heuristic and real-time information, and mapping it up with static code analysis results. And that's really the holy grail of figuring things out. We have an ASPM product, or maybe it's a feature and offering, if you will. But it's something that I think provides which gives security admins a lot more insight into that type of operation at their business. But you're totally right, Corey, it's a really difficult problem to solve. And it burns a lot of goodwill in the security community and in the industry because people spend a lot of time getting false alerts, going through stuff and just wasting millions of hours a year, I'm sure.

Starting point is 00:15:29 That's part of the challenge, too, is that it feels like there are two classes of problem in the world, at least when it comes to business. And I found this by being on the wrong side of it on some level. Here on the wrong side, it's things like caring about cost optimization. It's caring about security. It's remembering to buy fire insurance for your building. You can wind up doing all of those things, and you should be doing them, but you can over-index on them to the point where you run out of money and your business dies. The proactive side of that fence is getting features to market sooner, increasing market share, growing revenue, et cetera. And that's the stuff that people are always going to prioritize over the back burner stuff.

Starting point is 00:16:01 So striking a balance between that is always going to be a bit of a challenge. And where people land on that is going to be tricky. So I think this is a really good bridge. You're totally right. It's expensive to waste people's time, basically, is what you're saying, right? You don't want to waste people's time. You want to give them actionable alerts that they can actually fix or hopefully fix it for them if you can, right? So I'm going to lay something out, which is, in our opinion, is the sneak way, if you will, that you should be approaching these developer security issues. So let's take a look at two different approaches. The first approach is going to be using an LLM, let's say just ChatGPT. We'll call them out because everyone knows ChatGPT.

Starting point is 00:16:43 The first approach we're going to take is using... Although I do insist on pronouncing it ChatGipity, but please continue. I love that. I haven't heard that before. ChatGipity. Sounds so much more fun, you know? It sounds more personable. Yeah. So you're talking to ChatGipity. Thank you. And you paste in a file from your codebase and you say, Hey, ChatGipity, here's a file from my codebase.

Starting point is 00:17:04 Please help me identify security issues in here. And you get back a long list of recommendations. Well, it does more than that. Let me just interject there, because one of the things it does that I think very few security engineers have mastered is it does it politely and constructively, as opposed to having an unstated tone of, you dumbass, which I've trusted with prompts on this. You can get it to have a condescending, passive-aggressive tone, but you have to go out of your way to do it as opposed to it being the default.

Starting point is 00:17:33 Please continue. Great points. Also, Daniel from Unsupervised Learning, by the way, has a really good post where he shows you setting up ChatGipity to mimic Scarlett Johansson from the movie Her on your phone so you can talk to it. Absolutely beautiful.

Starting point is 00:17:47 And you get these really fun, very nice responses back and forth around your code analysis. So shout out there. But going back to the point. So if you get these responses back from ChatGivity and it's like, hey, look, here's all these security issues. A lot of those things will be false alerts. And there's been a lot of public security research done

Starting point is 00:18:06 on these analysis tools to just give you information. A lot of those things will be false alerts. Some things will be things that maybe they're a real problem but cannot be fixed due to transitive dependencies or whatever the issues are. But there's a lot of things you need to do there. Now, let's take it up one notch. Let's say instead of using Chagivity directly,

Starting point is 00:18:24 you're using GitHub Copilot. Now, this is a much better situation for working with code because now what Microsoft is doing is let's say you're running Copilot inside of VS Code. It's able to analyze all the files in your code base, and it's able to use that additional context to help provide you with better information. So you can talk to GitHub Copilot and say, hey, I'd really like to know what security issues are in this file. And it's going to give you maybe a little bit better answers than ChatGPT directly, because it has more context about the other parts of your code base and can give you slightly better answers. However, because these things are LLMs, you're still going to run into issues with accuracy and hallucinations and all sorts of other problems. So what is a better approach? And I think that's fundamentally what people want to know. What is a good approach here? And on the scanning side, the right approach in my mind is using something very domain specific. Now, what we do at Snyk is we have a symbolic AI scanning engine. So we take customers' code and we take an entire code base. So you have access to all the files and dependencies and things like this. And you take a look at these things. And we have a security analyst team that

Starting point is 00:19:29 analyzes real world security issues and fixes that have been validated. So we do this by pulling lots of open source projects as well as just other security information that we originally produced. And we define very specific rules so that we can take a look at software and we can take a look at these code bases with a very high degree of certainty and we can give you a very actionable list of security issues that you need to address. And not only that, we can show you how is going to be

Starting point is 00:19:56 the best way to address them. So with that being said, I think the second side to that is, okay, if that's a better approach on the scanning side, maybe you shouldn't be using LLMs for finding issues. Maybe you should be using them for fixing security issues, which makes a lot of sense. in combination with your code base, and fire off a request to an LLM and say, hey, ChatGivity, please take this code base and take this security information that we know is accurate and fix this code for me.

Starting point is 00:20:31 So now you're going one step further. One challenge that I've seen, especially as I've been building weird software projects with the help of magic robots from the future, is that a lot of components like in React, for example, get broken out into their own file. And pasting a file in is all well and good, but very often it needs insight into the rest of the code base. At GitHub Universe, something that they announced was Copilot

Starting point is 00:20:55 Enterprise, which trains Copilot on the intricacies of your internal structures around shared libraries, all of your code, etc. And in some of the companies I'm familiar with, I really believe that's giving a very expensive smart robot a form of brain damage. But that's neither here nor there. But there's an idea of seeing the interplay between different components that individual analysis on a profile basis will miss feels to me like something that needs a more holistic view. Am I wrong on that? Am I oversimplifying? You're right. There's two things we need to address. First of all, let's say you have the entire application context,

Starting point is 00:21:31 so all the files, right? And then you ask an LLM to create a fix for you. This is something we do at Snyk. We actually use LLMs for this purpose. So we take this information, we ask the LLM, hey, please rewrite this section of code that we know has an issue, given this security information to remove this problem. The problem then becomes, okay, well, how do you know this fix is accurate and it's not going to break people's stuff? And that's where

Starting point is 00:21:52 symbolic AI becomes useful again. Because again, what is the use case for symbolic AI? It's taking very specific domains of things that you've created very specific rule sets for and using them to validate things or to pass arbitrary checks and things like that. And it's a perfect use case for this. So what we actually do with our autofix product, so if you're using DS code

Starting point is 00:22:11 and you have Copilot, right? And Copilot's spitting out software. As long as you have sneak in the IDE too, we're actually taking a look at those lines of code Copilot just inserted. And a lot of the time,

Starting point is 00:22:22 we are helping you rewrite that code to be secure using our LLM stuff. But then as soon as we get that fix created, we actually run it through our symbolic engine. And if we're saying no, it's actually not fixed, then we go back to the LLM and we reprompt it over and over again until we get a working solution. And that's essentially how we create a much more sophisticated iteration, if you will, of using AI to really help improve code quality. But all that being said, you still had a good point,

Starting point is 00:22:50 which is maybe if you're using the context from the application and people aren't doing things properly, how does that impact what LLMs are generating for you? And an interesting thing to note is that our security team internally here just conducted a really interesting project. And I would be angry at myself if I didn't explain it because I think it's a very cool concept. Oh, please. I'm a big fan of hearing what people get up to with these things in ways that is real world stories, not trying to sell me anything, or also not dunking on what I saw on the top of Hacker News the other day, which is if all you're building is something that talks to ChatGipity's API,

Starting point is 00:23:28 does some custom prompting and returns a response, you shouldn't be building it. I'm like, well, I've built some things that do exactly that, but I'm also not trying to raise $6 million in seed money to go and productize it. I'm just hoping someone does it better eventually, but I want to use it today. Please tell me a real-world story about something that you've done. Okay, so here's what we did. We went out and we found a bunch of

Starting point is 00:23:49 GitHub projects. And we tried to analyze them ourselves using a bunch of different tools, including human verification, and basically give it a grade and say, okay, this project here has really good security hygiene. There's not a lot of issues in the code. Things are written in a nice way. The style and formatting is consistent. The dependencies are up to date, etc. Then we take a look at multiple GitHub repos that are the opposite of that. Maybe projects that haven't been maintained in a long time were written in a completely different style where you have bad hygienic practices. Maybe you have hard-coded secrets. Maybe you have unsanitized input coming from a user or something, right? But you take all these things. So we have these

Starting point is 00:24:30 known examples of good and bad projects. So what do we do? Well, we open them up in VS Code, and we basically got GitHub Copilot. We said, okay, what we're going to do is use each of these codebases. We're going to try to add features into the projects one at a time. And what we did is we took a look at the suggested output that Copilot was giving us in each of these cases. And the interesting thing is that, and I think this is super important to understand about LLMs, right? But the interesting thing is,

Starting point is 00:25:00 if we were adding features to a project that has good security hygiene, the types of code that we were able to get out of LLMs, like GitHub Copilot, was pretty good. There weren't a ton of issues with it. The actual security hygiene was fairly good. However, for projects where there were existing issues, it was the opposite. We'd get AI recommendations showing us how to write things insecurely

Starting point is 00:25:22 or potentially write things with hard-coded secrets in it. And this is something that's very reproducible today. showing us how to write things insecurely or potentially write things with hard-coded secrets in it. And this is something that's very reproducible today in, you know, what is it right now? Middle of November 2023. Now, is it going to be this case a year from now? I don't necessarily know. But right now, this is still a massive problem. So that really reinforces the idea that

Starting point is 00:25:39 not only when you're talking about LLMs is the training set they use to build the models important, but also the context in which you're using them is incredibly important. It's very easy to mislead LLMs. Another example of this, if you think about the security scanning concept we talked about earlier, imagine you're talking to ChatGipity and you're pasting in a Python function. And the Python function is called Completely Safe Not Vulnerable Function. That's the function name.

Starting point is 00:26:05 And inside of that function, you're backdooring some software. Well, if you ask ChatGipity multiple times and say, hey, the temperature is set to 1.0, is this code safe? Sometimes you'll get the answer yes, because the context within the request that has that thing saying this is not a vulnerable function or whatever you want to call it, that can mislead the LLM output and result in problems. It's just like classic prompt injection type issues. But there's a lot of these types of vulnerabilities still hidden in plain sight that impact all of us. And so it's so important to know that you can't just rely on one thing. You have to have multiple layers, something that helps you with things, but also something that is helping you fix things when needed. I think that's the key that gets

Starting point is 00:26:50 missed a lot is the idea of, it's not just what's here, what have you put here that shouldn't be, what have you forgotten? There's a different side of it. It's easy to do a static analysis and say, oh, you're not sanitizing your input on this particular form. Great. Okay. Well, I say it's easy. I wish more people would do that. But then there's also a step beyond of what is it that someone who has expertise, who's been down this road before, would take one look at your code base and say, are you making this particular misconfiguration or common misstep? Yeah, it's incredibly important.

Starting point is 00:27:23 Like I said, security is just one of those things where it's really broad. I've been working security for a very long time, and I make security mistakes all the time myself. In your developer environment right now, you ran this against the production environment, didn't get permissions errors. That is suspicious. Tell me more about your authentication pattern. Right. I mean, there's just a ton of issues that can cause problems. And it's... Yeah, it is what it is, right?

Starting point is 00:27:48 Software security is something difficult to achieve. If it wasn't difficult, everyone would be doing it. Now, if you want to talk about vision for the future, actually, I think there's some really interesting things with the direction I see things going. A lot of people have been leaning into the whole AI autonomous agents thing over the last year. People started out by taking LLMs and saying,

Starting point is 00:28:09 Okay, I can get to spit out code, I can get to spit out this and that. But then you go one step further and say, All right, can I get it to write code for me and execute that code? And OpenAI, to their credit, has done a really good job advancing some of the capabilities here, as well as a lot of open source frameworks. You have Langchain and Baby AGI and AutoGPT and all these different things that make this more feasible to give AI access to actually do real meaningful things. And I can absolutely imagine a world in the future, maybe it's a couple years from now, where you have developers writing software.

Starting point is 00:28:42 And it could be a real developer, it could be an autonomous agent, whatever it is. And then you also have agents that are taking a look at your software and rewriting it to solve security issues. And I think when people talk about autonomous agents, a lot of the time they're purely focusing on LLMs. I think it's a big mistake. I think one of the most important things you can do

Starting point is 00:29:00 is focus on the very niche symbolic AI engines that are going to be needed to guarantee accuracy with these things. And that's why I think the Snyk approach is really cool. We dedicated a huge amount of resources to security analysts building these very in-depth rule sets that are guaranteeing accuracy on results. And I think that's something that the industry is going to shift towards more in the future as LLMs become more popular, which is, hey, you have all these great tools doing all sorts of cool stuff. Now let's clean up and make it accurate. And I think that's where we're headed in the next couple of years.

Starting point is 00:29:32 I really hope you're right. I think it's exciting times. But I also am leery when companies go too far into boosterism, where robots are going to do all of these things for us. Maybe, but even if you're right, you sound psychotic. And that's something that I think gets missed in an awful lot of the marketing that is so breathless with anticipation.

Starting point is 00:29:51 I have to congratulate you folks on not getting that draped all over your message once again. My other favorite part of your messaging when you pull up sneak.com, sorry, sneak.io. What is it these days? It's the.io, isn't it?

Starting point is 00:30:04 .io, it's hot. .io, yes. It's still hot, you know? I feel like I'm turning into a boomer here where the internet is.com. It doesn't necessarily work that way. But no, what I love is the part where you have this fear-based marketing

Starting point is 00:30:16 of if you wind up not using our product, here are all the terrible things that'll happen. And my favorite part about that marketing is it doesn't freaking exist. It is such a refreshing departure from so much of the security industry, where it does the fear, uncertainty, and doubt nonsense stuff that I love that you don't even hint in that direction.

Starting point is 00:30:33 My actual favorite thing that is on your page, of course, is at the bottom. If you mouse over the dog and the logo at the bottom of the page, it does the quizzical tilting head thing. And I just think that is spectacular. So the sneak mascot, his name is Patch. He's a doperman and everyone loves him. But yeah, you're totally right. The FUD thing is a real issue in security. Fear, uncertainty, and doubt. It's the way security companies sell products to people. And I think it's a real shame. I give a lot of tech talks at programming conferences in particular

Starting point is 00:31:01 around security and cryptography. And one of the things I always start out with when I'm giving a tech talk about any sort of security or cryptography topic is I say, okay, how many of you have landed in a Stack Overflow thread where you're talking about a security topic and someone replies and says, oh, a professional should be doing this, you shouldn't be doing it yourself. That comes up all the time when you're looking up security topics on the internet. Then I ask people, how many of you feel like security is this sort of obscure, mystical arts that requires a lot of expertise in math knowledge and all this stuff? And a lot of people sort of have that impression. The reality, though, is security, and to the same extent, cryptography, is just like any other part

Starting point is 00:31:42 of computer science. It's something that you can learn. There's best practices. It's not rocket science. Maybe it is if you're developing a brand new hashing algorithm from scratch. Yes, leave that to the professionals. But using these things is something everyone needs to understand well. And there's tons of material out there explaining how to do things right. And you don't need to be afraid of this stuff. And so I think a big part of the sneak message is, we just want to help developers just make their code better. And what is one way that you're going to do a better job at work? Get more of your code through the PR review process? What is a way you're going to get more features out?

Starting point is 00:32:18 A big part of that is just building things right from the start. And so that's really our focus and our message is, hey, developers, we want to be like a trusted partner to help you build things faster and better. It's nice to see it just because there's so much that just doesn't work out the way that we otherwise hope it would. And historically, there's been a tremendous problem of differentiation in the security space.

Starting point is 00:32:41 I often remark that at RSA, there's about 12 companies exhibiting. Now, sure, there are hundreds of booths, but it's basically the same 12 things. There's the entire row of firewalls where they use different logos and different marketing words on the slides, but they're all selling fundamentally the same thing. One of the things I've always appreciated about Snyk

Starting point is 00:32:59 is that it's never felt that way. Well, thanks. Yeah, we appreciate that. I mean, our whole focus is just developer security. What can we do to help developers build things securely? I mean, you are sponsoring this episode, let's be clear. But also, we are paying customers of you folks. And that is not, those things are not related in any way. What is the line that we like to use that we stole from the Redmonk folks? You can buy our attention, but not our opinion. And our opinion of what you

Starting point is 00:33:24 folks are up to has been stratospherically high for a long time. I certainly appreciate that as a Snyk employee who is also a happy user of the service. The way I actually ended up working at Snyk was I've been using the product for my open source projects for years. And I legitimately really liked it.

Starting point is 00:33:39 And I thought this was cool. And eventually ended up working here because there was a position and a friend reached out to me and stuff. But I am a genuinely happy user and just like the goal and the mission. We want to make developers' lives better. And so it's super important. I really want to thank you for taking the time

Starting point is 00:33:57 to speak with me about all this. If people want to learn more, where's the best place for them to go? Yeah, thanks for having me. If you want to learn more about AI or just developer security in general, go to sneak.io. That's S-N-Y-K, in case it's not clear,.io. In particular, I would actually go check out our Sneak Learn platform, which is linked to from our main site.

Starting point is 00:34:19 We have tons of free security lessons on there showing you all sorts of really cool things. If you check out our blog, my team and I in particular also do a ton security lessons on there showing you all sorts of really cool things. If you check out our blog, my team and I in particular also do a ton of writing on there about a lot of these bleeding edge topics. And so if you want to keep up with cool research in the security space like this, just check it out. Give it a read. Subscribe to the RSS feed if you want to. It's fun.

Starting point is 00:34:38 And we will put links to that in the show notes. Thanks once again for your support and of course, putting up with my slings and arrows. And thanks for having me on and thanks for using Sneak too. We love you. Randall Deggs, Head of Developer Relations and Community at

Starting point is 00:34:53 Sneak. This featured guest episode has been brought to us by our friends at Sneak and I'm Corey Quinn. If you've enjoyed this episode, please leave a five-star review on your podcast platform of choice. Whereas if you've hated this episode, please leave a five-star review on your podcast platform of choice. Whereas if you've hated this episode, please leave a five-star review on your podcast platform of choice, along with an angry comment that I will get to reading immediately. You can get me to read it even faster if you make sure your username is set to Dependabot. If your AWS bill keeps rising

Starting point is 00:35:22 and your blood pressure is doing the same, then you need the Duck Bill Group. We help companies fix their AWS bill by making it smaller and less horrifying. The Duck Bill Group works for you, not AWS. We tailor recommendations to your business and we get to the point. Visit duckbillgroup.com to get started.

CODACE Plant Stand

Screaming in the Cloud - Taking a Hybrid AI Approach to Security at Snyk with Randall Degges

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.

Your Ad Here

CODACE Plant Stand

Screaming in the Cloud - Taking a Hybrid AI Approach to Security at Snyk with Randall Degges

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.