How I AI - How Intercom 2x’d their engineering velocity in 9 months with Claude Code | Brian Scanlan

Starting point is 00:00:00 Suddenly you started realizing that you have to think bigger about things or that your imagination is now the barrier, not the tool. How is this not happening in your organization? Like literally the physical limits of my ability to type code are unlocked by AI. Today we are seeing twice the number of throughput as we did compared to nine months ago on our engineering team. Now it's like, why can't it be 10x? This is a little bit more of what my instinct tells me as possible, which is if you go all in, if you prepare your team, if you prepare your code base, I think you're overall. product quality is going to go up. I think your overall developer experience is going up.

Starting point is 00:00:33 There's just so many good things that come out of using these tools and using them correctly. Backlog zero is a realistic thing for teams to be able to go after. All the things that you wish you would ever want it to do, it's now just achievable. I often advise a lot of CTOs and VPs of engineering when figuring out how to get their engineering team AI-pilled. Say, everything you hate about the code base, go spend a month fixing and see how fast we can speed run that. That's going to feel really good.

Starting point is 00:00:59 I've been having the most amount of fun in my career over the last three months. Welcome back to How IAI. I'm Claire Vote, product leader and AI obsessive here on a mission to help you build better with these new tools. Today, I am showing how Intercom 2xed, the number of PRs that their R&D department is shipping in just a few months. Brian Scanlan is a senior principal engineer at Intercom and he is going to show us truly all of their secrets to getting a large product and engineer. organization cooking on Claude Code. Let's get to it. This episode is brought to you by Soligo. Every company today wants AI to improve how work gets done. The fastest way is building it directly into everyday business processes, automating employee onboarding, keeping customer

Starting point is 00:01:50 data accurate, managing orders and inventory, or resolving finance and operations issues. When AI lives inside the flow of work, it can update records, trigger approvals, route work, and kick off the next step across systems. That's how teams operationalize AI and deliver measurable results. Soligo makes this possible, and now with Soligo ORA it's never been easier. Soligo ORA gives you access to the entire platform through natural language, connecting your systems and turning intent into action. All of it under your control. Companies like Databricks, PayPal, and Olipop rely on Seligo to run critical business operations at scale. Ready to operationalize AI, visit Saligo.com slash how I.AI.I. That's CELIGO.com slash how I AI. Brian, welcome to how I

Starting point is 00:02:46 A.I. Why I am so thrilled that you agree to join the podcast is I think Intercom has done it, which is you all have met the moment in sort of two ways. One, clearly met the moment from a product perspective, where one of the first companies that had, sorry, I don't want to say legacy business, but had an going concern business that saw AI coming and really transformed how your product worked for customers. And I'm a happy Finn customer. They did not tell me to say that.

Starting point is 00:03:16 And then second, what we're going to talk about is the team at the moment in terms of really understanding AI was going to change how, particular product engineering and design orgs and engineering organizations were going to work. And you just went full speed at changing how the team works. What drove sort of the urgency around meeting the moment? How did that come to be? Was it a single person? Was it everybody? What was your experience? I think in some ways it's been the easiest place to be driving out the adoption of AI in engineering and product, because we've focused the company so much on our folks and product on adopting AI and being AI first in how we think about the product,

Starting point is 00:04:02 future customer support, and all that. And we also had very clear expectations. Like we, you know, we've seen what's possible in the product space. And it's just very clear and obvious to us as, like, connoisseurs of AI. It's like, this is clearly going to be huge in engineering and product and building. And honestly, there's been a lot of impatience for like, why isn't this happening today? You know, if we go back a few years and cursors picking up a bit of business and the models are getting better. But it still wasn't transformative. It still wasn't like the whole business was changed and we're seeing vast amounts of extra productivity. We knew those potential, but it still felt like we needed to have some sort of breakthrough moments or something needed to big had to happen for us to get to the kind of,

Starting point is 00:04:50 huge velocity wins that I think now we're starting to achieve. That said, we still want more. You know, we're proud of where we're asked, but we're not content with what we've achieved so far. I feel like every three months, I have a breakthrough moment. And in fact, I feel like Opus 4-6, I don't know, something just like really inflected in what was possible when that particular model came out. Now I think the GPT-5-4 models are.

Starting point is 00:05:20 also exceptional. And so it was something about that one moment with models that really inflected my own personal use of AI and engineering. Did you all see the same sort of inflection around that model point? Totally. I think you can go back to like November, December, last year, and suddenly you started realizing that you have to think bigger about things or that your imagination is now the barrier, not the tool. You're spending less time massaging the tool to get it to the right place and it's less about auto companies and more about just literally giving us your ideas and seeing what happens. I think the Christmas break happened as well. I remember we pretty much decided for Christmas like, hey, we're going to go all in in Cloud Code because up to that point,

Starting point is 00:06:03 there was a bit of cursory here and there and augmenting different tools. And the Christmas break really helped. I just saw everybody go wild on Twitter X, you know, that people were talking about how this was like they were getting so much done. and they were building all these things. It's come back to work after Christmas break going like, okay, everything's changed. Like, we knew that there was something here and that we were starting to see the signs of us.

Starting point is 00:06:26 But now the whole world is convinced, or at least all of the influencers on Twitter and I like. That would be me. Yeah, I'm actually kind of convinced that companies should increase their PTO and parental leave policies because everybody I know right now in tech that is quote unquote taking time off goes on their vacation and pops open Claude Code and comes back like, 10 times more skilled than they were before their time off.

Starting point is 00:06:52 And so if anybody wants a little minor hack to AI literacy in your org, give people time off to hack. And they will come back with more information than you expected. Okay. I think we're going to skip to the punchline, which I love, which is we're going to see how AI has actually changed how you all ship at intercom. So can you just show us a little bit of how this is changed inside the org? And I think you all are measuring a lot of this. Yeah. So I think we've been diligent as product owners inside of Intercom that we've been trying to get feedback from people and see how they're using the tools and really like just doing everything that we would normally do with a regular product.

Starting point is 00:07:35 And so we've spent a lot of time hooking up cloud code with telemetry, both into things like Honeycomb and data also going into Snowflake where we have our data warehouse. We also store session data in S3, and we mine this stuff for useful insights. And one of the main things that we use to drive adoption of the tool was our CTO, Dara, setting a goal of us to Xing, like doubling the throughput of R&D. And we use pull requests as a crude, simple measure. But, you know, there's, and you can argue back and forth about what's a good measure, what's a bad measure, whether measuring anything's appropriate or whatever.

Starting point is 00:08:19 But I think it's reasonable to just have the expectation that if you can get a lot more done and it's so fast and fun, then why isn't everyone just shipping more stuff? And so it's a basic measure that, like, the tools are being adopted and that they're being used well. And, you know, of course,

Starting point is 00:08:34 we don't tolerate lowering quality and we're high-trust environments, so we don't expect people not to game these stats or whatever. But our metrics and what I'm showing on the screen here is, you know, it's a classic number goes up kind of thing that where we started tracking this back, like how many PRs and what percentage of them were generated by either Claude or cursor or whatever. And yeah, since our major investment in CloudCode the platform and going all in on it and really pushing out like enablement and giving people

Starting point is 00:09:08 freedom to explore and start to build skills and everything, but also pushing them on, on, on, we expect kind of throughput increase, we've seen a big, big increase in the throughput of pull request to our system. And, you know, like last year, like our CI system completely broke. It melted. It, you know, but I mean, it got like 10 times as expensive. And, you know, we did the work. We fixed the bottlenecks. We improved the performance of our CI system. Matt stopped being the bottleneck. And now, Coderview is our bottleneck. But like, we're still, but today we are seeing twice the number of throughput as we did compared to. to nine months ago on our engineering team.

Starting point is 00:09:47 And like, we're very proud at us. And, you know, now it's like, why can't it be 10x? So what I love about this chart just for a moment is I had spent the last two decades of my career in product and engineering, last decade of my career as a CPTO. And it's so funny, I want to go back to a couple of things you said, which is, one, you have to treat your org like a product. And I always thought that my job was not just the product strategy and the capital P. product that we were delivering to customers. It was to design our organization to, I would say,

Starting point is 00:10:20 like, output innovation on demand, which is that was the job. And less romantically, more than less romantically put, my job is to invest R&D for positive enterprise value. That was like fundamentally my job as a CPTO. And so what I love about this is it's merge PRs per R&D head. I'm presuming that includes, does that include product managers and non-engineering R&D or is that purely software engineers? Yeah, this is all of R&D, and it's definitely the case that our designers and product managers and TPMs, like every role in Intercom is really actively using Cloud Code and shipping code and all that. And also, we've been hiring. Like, this number has not been static.

Starting point is 00:11:04 So the number of PRs, the raw number is dramatically hard than just 2X what it was a good while ago. So this is everything from your newest hire to... your product manager who's like adding some copy or shipping like small changes or whatever. You know, that's all based in this number. The other thing I want to call up for folks is every board meeting I have been in for the last three years have said, how are we getting? Well, actually, every board meeting I've ever been, period, has been, how can we get more velocity out of R&D?

Starting point is 00:11:36 Certainly in the last three years, it's been how is AI inflecting our velocity? And it's so funny. I talk to so many people that are like, it doesn't really inflect velocity. We're not actually becoming that more efficient. And I'm like, is that true? Because I look at a chart like this. And I say, this is a little bit more of what my instinct tells me is possible, which is if you go all in, if you prepare your team, if you prepare your code base, if you have, as you said, I think a high trust culture, people are going to look at this and say, oh, they're shipping these smaller PRs or like engineers are gaming the system. I just,

Starting point is 00:12:10 I have not worked at a place that has such kind of like bad culture that, that would actually come as an outcome of setting some sort of ambitious fun target like this. And so I take this at face value. And I think, how is this not happening in your organization? Like literally the physical limits of my ability to type code are unlocked by AI. You should get some inflection there. And so, you know, for VPs of engineering, CTO, even people that are on these R&D teams, look at this and think, you know, this is possible. And it may be a crude measure. but it's, I think, an appropriate one as a leading indicator of what's happening in your org around AI. Yeah, and we support this with not just telling people to move faster.

Starting point is 00:12:56 Like, that's, you know, we're really looking from first principles of how to, how to do the work. Like, we believe that like all technical work will become agent first. And I'd like to set like a deadline for that, that, you know, at the end of the month, we're just going to go all in. And it's never going to be the first thing that happens, say, in response to an alarm or in a planning meeting, that there isn't like an agent in there kind of doing the basic work. And I think that's a realistic expectation. But it involves not just, we're not just moving faster for the sake of it. We're seeing that we're moving faster by looking at the fundamentals of where we're spending our time and reimagining how that work could be done in an agentic world. And honestly, if like the, if the agents didn't get better,

Starting point is 00:13:42 if the models didn't get better, the harnesses didn't get better, we've got the building blocks just today to be able to just continue going, moving around, looking at how we do our technical work today. By technical work, I mean everything in delivery of product and move it to entirely be agents first and allow us to move up to a higher level, to be able to work on higher level concerns

Starting point is 00:14:06 or just getting more stuff built, more stuff out there, or higher quality. That's all within every organist grasp today. But you have to be very open to change. And I guess what's been fortunate in to come over the last while is that we have been extremely open for change, both in the product side of things and adapting the company to how, I think,

Starting point is 00:14:24 companies need to work now with AI. And we're starting to see results. Yeah, the other reflection I have upon looking at this chart is we're recording this in kind of the spring of 26. And Anthropic just said that they crossed 30 billion in revenue. I think up from 19 a couple months ago. And I suspect their revenue chart looks a little bit like your merge PRs per R&D chart. So how are you all thinking about the tradeoff on cost here, right?

Starting point is 00:14:54 Like we're all consuming Claude tokens. Yes, you know, efficiency or output is going up, throughput's going up. But is cost scaling proportionately? Are you all worried about this? Is that the problem right now? Are you even worried about it? How do you think about that? Yeah, we're definitely worried in that.

Starting point is 00:15:11 the build looks exactly like this. And, you know, I spent a lot of my career worrying about AWS costs and worrying about our margins and stuff. And then suddenly you've got these costs showing up and they're disproportionate to the growth that we've seen anywhere before. It's like hiring whole new offices of people. And but at the moment our attitude has been, look, everyone just turn on an opus for everything.

Starting point is 00:15:38 One million with context window. So, you know, we just use the API plan, so it's all just on demand. And we think that there's enough alpha or benefit in, at this point, going as fast as possible and caring about the bill later because of the later benefits will get. Maybe that's a position of where income is. I don't think it's realistic or feasible for absolutely every single business to do it. And honestly, I do kind of respect when you have to actually think about your token use and how that can kind of force you to be more considerate,

Starting point is 00:16:13 or it sometimes even gets you better results. You know, you don't need Ocus for everything. There's faster models out there. And so we're just kind of avoiding that optimization phase until the point of where we, until, you know, we've gotten serious benefits from investments in this platform. And so I think this investment, and I think we are treating us like an investment at this point,

Starting point is 00:16:35 is worthwhile. But, you know, if this keeps going, at this race. Yeah, we should all work for Anthropic, you know. I think the way they're hiring, we're all going to end up working for Anthropics. So, okay, and then one other thing, because I think, you know, folks are going to look at this, certainly engineers and they're going, okay, like you're shipping more PRs, but it's all sloped. It's all garbage. You know, I know you all are measuring quality on the outside of this, on the other side of shipping all this stuff. So how have you seen this inflect your measurements

Starting point is 00:17:05 around quality or customer value or what you're trying to achieve at the end? and not just lines of code. Yeah, I have a stand-alone graph that I can share, which is kind of interesting. And so we've started to look at the time it takes from the first line of code written in a feature to the time it gets posted on our news channel, like our updates. And that has decreased consistently over the last few months. And we're not optimizing for this, but we're interested in it. And the other thing is, like, the sheer volume of things we have shift also appears to have kind of just rapidly increased in the last few months as well.

Starting point is 00:17:46 And that should be a bit of a trailing metric. So we believe that these numbers, these like increase in volume is being borne out in real features, real products that our customers are using. And even we've been running some experiments like how far can one person get on their own building something that's plausibly a whole entire product area. feature to be able to sell. So this is something we're taking seriously. And we also care a lot about quality. We've been working with a research group in Stanford. We've been giving them our data and, you know, mostly just looking for any kind of insights to make sure we're not blind. You know, I join absolutely every single incident. I'm an ambulance chaser and I make like, and I'm not seeing any increase in kind of regular kind of incidents or editors or customer facing problems.

Starting point is 00:18:38 We've had a few kind of weird problems, but not related to production. And but also the interesting thing from the Stanford data when we checked back in on it last week was that their measures of code quality reckons that the code quality was improving. And, you know, the models are improving. The ages are improving. We're adding more and more guidance and skills and all these kind of things, which I think do craft or do force people down the road, which should result in higher quality output. it, but it's great to see when tools can independently pull that out. Now, devils and details, you've got to go into the weeds. You've got to actually really have a strong sense for what quality means in your own environment.

Starting point is 00:19:18 But, you know, we're not seeing some of the things that people are worried about out there. But that's it. We've got a mature environment. We're 15-year-old. SaaS company, we've been doing this for years. You know, AI and speeding up your velocity will magnify all of your strengths and weaknesses. And thankfully, I think we've got a lot of. of strengths on the software to the re-side of things that we've been able to take advantage of.

Starting point is 00:19:40 One thing that I want to kind of call out here, which is you said that you've seen your code quality increase, which again, intuitively, I've always believed to be the ultimate endgame of this. And every engineer, not every, many engineers that I've talked to and just don't believe it to be true. But when you have the capacity to take on tech debt, when you have the capacity to take on the dragons in your code base, you actually can do those things, whether it's developer experience, security and compliance, just general maintainability of your code base, flaky test, improving your CICD, all those things become very tractable, not just technically, not just can an engineer execute on it, but actually the business, and I feel like people don't appreciate this,

Starting point is 00:20:25 the business capital T, capital B, only has so much capacity for internal projects, meaning we can only allocate so much of R&D towards improving code quality, just how we live. We don't generate ARR on code quality, unfortunately. But when the cost of doing that compresses, then you're able to say, yes, as a business, we should invest there, one, because we can. and two, because it'll unlock velocity on the outside for our agents and for our product managers and for engineers. And so I think this is actually a really important moment for folks to invest in code quality. And I often advise a lot of CTOs and VPs of engineering when figuring out how to get their engineering team AI-pilled, say,

Starting point is 00:21:16 everything you hate about the code base, go spend a month fixing and see how fast we can speed run that. That's going to feel really good. Okay, we've chit-chatted, we've shown graphs. The point of how AI is to actually ship some code. So let's switch over to that. We can probably come back to all these topics. I think they're so interesting. But you're going to show us how you all, again, in your mature code-based, mature organization,

Starting point is 00:21:39 are actually getting things live and some stuff you've done in the repo to make that possible. Yeah, sure. So I'm going to do a pretty trivial change in our majestic Rubion Males monolith. So this is... I love it. millions of lines of code, all the tests. Yeah, the code base is older than Intercom. It was created before Intercom was incorporated.

Starting point is 00:22:01 And, you know, it's got its problems, but we love it, and we tend to it. And so I'm just going to do a relatively simple change of adding a lobster emoji, Rails redirect, to chat p.r.D. AI. So, also, I try and give hints to Claude when I'm actually demoing something. I don't know if it actually helps, but it makes me feel better. I'm just trying to add a bit of urgency here, you know. I think that's everybody's prompting strategy, which is, I don't know if it helps, but it makes me feel better.

Starting point is 00:22:36 Totally. That's a nice way to interact with the agents, you know? And so what we're seeing here is, I mean, it's already kind of figured out. where to put a redirect. It's got the nice lobster emoji. And is asking me if I want to open IPR. So obviously I do. And I think it's actually gotten the URL run.

Starting point is 00:22:58 It's at Intercom.com, which will have the URL, but we can tell cloud codes later on about that. So what we're seeing here is, first of all, an important point. I'm just going to scroll back up. One of the things we noticed early on when we started getting cloud code to write all of our code. And, you know, we're up well about 90%

Starting point is 00:23:16 is that it would create pull request descriptions that were kind of terrible. It would describe the code. And that's the least interesting part of a pull request. You actually, as a human or even as an agent reviewing code, you want to know the intent behind the pull request. You want to know the interesting bits, what's kind of related to this. And, you know, LMs are very good. That's just regurgitating or rewriting code into English.

Starting point is 00:23:42 That's fine, but it's not what we need. And so one of the things, and we noticed as well, when people were using Cloud Code, we created an LLM judge to evaluate, because we had suspicions that the quality of the pull request descriptions was going downhill. So we created an L&M judge to evaluate what does a good pull request. We decided what a good pull request description should look like and then got an LLM judge to go through all, like, once and months of data. And yeah, the trend was awful. the trend was going in one direction. And this is bad. And, you know, look, humans aren't perfect at creating pull request descriptions.

Starting point is 00:24:21 Sometimes they're just blank and whatever. But I think with our use of tools like cloud code and setting up these kind of platforms around it, you really have to be pushing for like higher standards. You want as close to perfection as possible. And this was clearly something that we're just not going to tolerate a lowering of standards in our environment. So we created a skill.

Starting point is 00:24:42 call CreatePOR. And what it does is it uses whatever context it can from the session to describe the pull request. So it's not quite rocket science. But often the session knows exactly why it's doing the thing. And so, but then we had to kind of force it in. You know, we started, we told people like, oh, just use the CreatePOR scale. And then people would want to use it. You don't really actually want to be how people remembering things. So we added it as a hook. So if Claude decides to you know, use the GitHub, CLI to open a pull request. We just block it and we say, yeah, tough. You need to use the create PR skill.

Starting point is 00:25:22 And also you're probably going to have to figure out a different text description. And then I might interview you if it's not enough context there. Hopefully there's enough context in this. But the point being that, you know, this is a platform. We want great outcomes. And we measure the inputs and outputs. And after we put this in place, the LRM judge

Starting point is 00:25:43 reckon we're doing a great job now. And so we're at higher quality pull request descriptions now. This is not the most important thing in the world. This is not going to get intercom to 2x or to 10x revenue or anything like that. But it's all of the composite little jobs that when you assemble means

Starting point is 00:26:03 you have an extremely competent engineer who works appropriately in our environment. And that's where we're putting our investments for each little skill and hook to do these things. So they almost looking consequential, but, you know, they result in better outcomes. And so we look through here, it's creating the PR. I'm going to have to check on what it's going. This probably will be automatically approved as well, which is pretty cool.

Starting point is 00:26:26 And we might even see some pull request feedback as well in action. It's still building. We'll come back to it in a couple of minutes. One thing I want to call out for folks, as you were describing sort of why you put in this skill to improve the PR and for those, who don't know. A skill is basically just like a set of instructions and sometimes scripts that a LLM or a agent harness can invoke at a certain step in your flow. One of the things that I was thinking as you were describing why you put this skill together and got really opinionated about PR descriptions is in engineering, we have been able to architect really opinionated CICD pipelines.

Starting point is 00:27:06 So how written code goes from being written to deployed in production. And we have, I mean, you saw it in GitHub, we have all these checks and lints and pre-deploy, you know, pre-flight things and preview branches, all these things once the code is written. What I think is really interesting about skills is you can bring some of that determinism to as you write the code, how you want that process to go. And we used to not be able to do it because it used to flow through the hearts and minds and hands. of humans, which are much harder to put in these structured guardrails. And we would do this by writing wikis or having, you know, SOPs where it said, can you please follow step A, B, C, D, C, D.E. And now you can just make it really easy to enforce those standards across the team, which I don't think is micromanaging. It's actually just making everybody's golden path much smoother to production. And so I

Starting point is 00:28:01 think there's this just very interesting parallel to how we've approached CICD to how we approach things more upstream, even from the product management perspective. Totally. We're on this movement towards a software factory. And what factories are great at is, you know, like an IKEA factory or something. It's all the same furniture, all the different bits, and you know how to assemble it. And look, it's not your artisan stuff. It's not, or it's not cutting edge or whatever. But it's very predictable and, you know, has a certain quality and meets certain standards when it comes out the other side of the factory. And so, well, pull request descriptions, again, they're not, they're not make or break for the factory or the pull request or whatever. It's one of those

Starting point is 00:28:44 qualities of just good quality work that's reliable, predictable, and then when to assemble together, you've got your IKEA factory. Well, and people don't want to feel, certainly engineers don't want to feel like they're part of a slot factory, right? And so these things that you can add into the flow that actually up level and meet the standards of the engineering team really help your human engineers on the team feel like they're working in a place that values quality. And so I appreciate that you've put that effort into these behind the scenes hooks and skills because I'm sure it reinforces to a culture that's being asked to move very fast to ship how you know, ship things differently than they have before that you still do care about their experience.

Starting point is 00:29:28 reading, pull, you know, pull request descriptions, they're, you meet their bar for quality. And I just think it makes everybody happier. Yeah. Well, it's great when the robots just produce the work that you'd expect of her best engineers, you know? Yeah. And, you know, maybe as you get this live, I also think there are just still such

Starting point is 00:29:50 more interesting problems to solve in software engineering. And we can talk a little bit later in the episode about some of the interesting problems that you all are solving on the product side, on the technical side. I think there is no lack of hard intellectually stimulating creative problems to solve for customers. And coding redirects is just 100% not one of them. So did we get my redirect live or are we close? It's still there. I'm waiting for an automatic review to kick in. But we can come back to us. So one of the things I would like to show next might be some of the telemetry that we have. in place. So we saw that, you know, there was different skills getting invoked and, and we don't

Starting point is 00:30:34 like flying blind. To run a system like this, you need to know how well people are using us. Are people using these skills at all? You know, the kind of basic information that you'd expect of, like, when you ship a product to your customers, like, you know, where can I see the usage? How can I fight for the usage? What's going wrong or what's not going wrong? And so we collect a bunch of telemetry using different mechanisms and have different homes for us. The most open one that we have is we collect basic usage information for skills and the like

Starting point is 00:31:07 and we send it to Honeycom. So we just have a shared key that's deployed to all of our laptops. And anyone can go in and kind of look through this data. So if you're developing a skill internally in Intercom and like hundreds of people do this, it's very easy for you to go into discover like, hey, who's actually using this?

Starting point is 00:31:24 when are they using us? And you've kind of used this as a kickoff to like follow up on just like basic discovery of usage of your skills and all. And like unsurprisingly, the kind of main skills that we have are things like creating PORs, admin tools is our admin like internal tooling APIs or where we have an MCP in front of it, build Kaiser CI system, Snowflake Log is where we put Snowflake. So you can see from this like a lot of work, a lot of the skills that are being invoked. We're all around the building and then seeing where my stuff.

Starting point is 00:31:54 is and maybe some troubleshooting type information as well. And so this is the first kind of step. It's like you don't have this. It's hard to have a large system like all these hundreds of skills and hundreds of creators working in this area without having decent telemetry. The next thing we do as well is we also collect all of the session data and put it into S3. And so we anonymize it.

Starting point is 00:32:17 We do a few things to make sure we're not doing anything too private. You know, people put all sorts of stuff in their sessions. They yell at their sessions. Yeah. Yeah, people have personal relationships at times with Claude. And like, we don't really want to know about that, but we do want to be able to dive deeper into how things are going. You know, I think understanding like what the dropout rate of sessions,

Starting point is 00:32:40 like, dude, how quickly people got to something useful, like whether it was a P.R. or something like that, this kind of information is pretty interesting. And so we're harvesting a lot of session data and we're doing different things. This is what I'm showing here on the screen is like a very simple tool that we put together, which just gives you some personalized insights. And you know, you can do this inside Claude these days as well. There's plenty of skills out there on GitHub where you can do session analysis.

Starting point is 00:33:08 But I think we just built a little tool on top of our session collection system to give people feedback. And it's feedback that we're interested in giving feedback about how their sessions are going and how they're kind of fitting in, how you should think about your own, I guess, use of Cloud Code compared to everybody else in the org. And, you know, I'm not doing too bad here. It's like 79% percentile. You know, someone has to be down the bottom of every percentiles. And there's some interesting feedback here.

Starting point is 00:33:36 Like, it's kind of getting annoyed at me. Oron, I was getting annoyed at Cloud a few weeks ago because I'd set up Gog to interact with all of our Google stuff internally. But it kept on trying to. to do the wrong thing and I was kind of getting a hit to it and ended up adding stuff to Claude and MD and stuff. It's kind of giving out to me here or it's reminding me that this wasn't a very effective way to interact with Cloud Curr. So, you know, it's a good prompt for me to actually go and fix up my memory or whatever. And like we all like people are at different levels,

Starting point is 00:34:12 even at intercom, people are different levels of adoption. People are joining Intercom. They may not have seen a system like this before and they want to know how things are going. and get feedback. And so this is one example of how we're just trying to pull together this information to give useful, actionable insights to people so that they feel supported and that we're not just throwing them an API key and saying best of look. It's like, no, we understand what growth looks like and the progression that people go through when they're using these tooling and getting better and kind of self-improving and we want to

Starting point is 00:34:44 support all that. So this is one of the things that we're doing with the session data. There's loads of other things that's work in progress. like being able to, like we want to get insights to which skills are the highest quality, which skills get you to your results as quickly as possible, and then which ones need work. You know, which ones aren't working out so well and might need a bit of attention to improve. This episode is brought to you by Cursor. If you all have been watching how I A.I., you already know this.

Starting point is 00:35:14 Cursor is my favorite way to code with AI, whether I'm using plan mode to build out an ambitious feature, reviewing AI-generated diffs right in my editor or kicking off cloud agents to multi-thread our roadmap, I reach for Cursor as my favorite multi-model coding platform. Even better than building myself in Cursor, I love collaborating with BugBot to fix PRs for code security and quality and have begun relying on Cursor's automated agents to keep our code base clean. It's not just me. The most ambitious teams love Cursor 2, including engineers at Stripe, OpenAI, and FickM. Ready to build more, we're giving $50 in cursor credit to how IAI listeners.

Starting point is 00:35:57 Claim your credits at chatprd.aI.ai slash how IAI. That's $50 in cursor credits by going to chatprd.aI.i. I have to pause before we look at your list of skills because I'm so excited about that part. But if folks aren't watching, they may have missed how amazing what you just showed is. So I'm going to reiterate it, which is, one, you've instrumented all your internal skills with telemetry so that in you're using honeycomb. Love the honeycomb team. You're using honeycomb to see how often those skills are invoked over time. So this is just a tip for anybody building out a skills repository internally or even somebody who is maybe trying to get some visibility into their impact across the org.

Starting point is 00:36:47 let's say you build a skill and you want to go to your boss and be like, boss, my skill is being used by literally everybody every day. Find a way to put event level telemetry invoked in the skill, a little dashboard, and you can track those over time. Again, treating your org like a product, treating your repo like a product, treating your AI setup as a team like a product, and all products, all good products have tracking plans. And so figuring out how you put that telemetry in, I think is really smart. And then the second thing for those that missed it or how to do it is you're taking all the raw session. I'm presuming JSON files. So for folks that don't know, ClaudeCode stores all your chats with Claude code on your computer in JSON. And you can go look at those or query those in any time. It sounds like you all are uploading those files to S3 and then layering on top of it some anonymization, some user level views. And then, you're essentially building sort of what I would call like an internal eval of how people are using Claude Code and what problems they are having over time so that individuals, one,

Starting point is 00:37:59 can triage their own implementation. As you said, oh, it looks like I need to do this or that or improve my agents MD. But then if you're seeing consistent themes over the organization on it's never invoking this MCP when we need it to invoke this MCP or people are yelling. no, every time the create PR skill gets queued up, you can fix that at a systems level. But you can't do that if you don't have the visibility. So again, my VPs of engineering, my CTOs, my friends out there, put some telemetry in your skills, and then do some meta-analysis on your clodcode sessions across the org, and you'll be able to identify places where some probably have some high leverage fixes

Starting point is 00:38:40 are going to get your team unblocked over time. I do hope and expect that this stuff will get easier over time. I'm happy to kind of invest the work so that we can move fast and kind of beyond the bleeding edge. But there's something to be said also for having like last mover advantage. And just getting all this stuff for free whenever Anthropic ship is or whoever shit. I mean, maybe this is a product just that people should buy or build. But for us right now, we've no choice. We just got to build it.

Starting point is 00:39:10 We're like we're fascinated with the insights that are locked away in these. sessions. And so we just got to build stuff so that we can see what's going on. I love it. Okay, can we see some of these skills? Yes. So it's a very exciting GitHub repo. Our lives are all GitHub repos and markdown files. Totally. And we have we have a lot of activity at the moment. We ran an AI day last week, kind of getting more people contributing to it. And so, well, So what this is, it's a plugin repo, and we have a series of plugins, and they're growing daily at the moment. Kind of every team will have their own kind of specific plugins. In general, though, we're very liberal.

Starting point is 00:40:00 We want stuff to end up in here, even if it's not great. But we do sweat the details on the core plugins, things that we think are fundamentals, foundational ones that go out to everybody. And so where we start off was we have like these base plugin, which gets installed. Oh yeah, so we distribute this not via the Claude Code plugin mechanism. We found it was just a bit flaky. It was, you know, sometimes it updates, sometimes it wouldn't. And it ended up kind of like trying to manage a Python install on hundreds of different laptops.

Starting point is 00:40:33 You know, it's, you just don't want to do it. And so we ended up using our internal IT systems to synchronize all of the plugins to the disk. of everyone's laptops. So this is a great cheat code. And, yeah, strongly I recommend getting very close with your IT team to be able to deliver things like this reliably and not have to rely entirely on the Claude Code plugins mechanism.

Starting point is 00:40:57 Just our experience is a bit flaky. And it just gives us a lot of reassurance. We don't have to do certain types of debugging once it's all on disk. So we know this stuff works anywhere because we've got our IT team pushing it out to disk. And so we got some safety hooks. We have some of the foundational things like merging PORs. We don't want our agents going off into AWS.

Starting point is 00:41:20 And then just different settings and the telemetry things as well. So these are the core things that absolutely everybody gets. But these are minimalists. We don't want anything that could be inappropriate in, say, a non-technical person's laptop or whatever. So this is like the basic building block. The next main bit for us is what we call developer tools. Again, this would be things that we then do all of engineering and beyond at this point. And these would be generally skills that would be appropriate to be used by any engineer in the course of their work day to day.

Starting point is 00:41:57 And again, we would have a high quality bar again for all of these. These would all require evals. These would all require to pass different kind of tests or analysis that we do on the quality of skills. And so we try and maintain these and make sure that they're well updated and well used and we pay a lot of attention to too. I can maybe go through one of these skills in a bit of detail. This one's near and dear to my heart. It's flaky specs. And I think the interesting part here is not the skill itself.

Starting point is 00:42:28 The skill does reliably fix flaky specs. And I can pull up in the meantime, like here is a list of flaky specs that we have at the moment. I'm going to open up the skill and just start to run it on this issue. And so while this is running, just walk through what's in the Flaky Speck skill. And so there's a checklist here. And the fun part about how I built this was not that I was a world-class expert of fixing flaky specs. I roughly know the problem and have fixed a few of them in my time. But there's different class of a large test environment like ours.

Starting point is 00:43:06 We have hundreds of thousands of tests. And if you're not super careful about the data poisoning or race conditions and all these kind of things that can kind of kick in when you're running millions and millions of tests a day, you know, you end up with these tests that end up slowing down your ability to deliver code to production fast and reliably and not confused developers by things randomly breaking. And there's kind of known patterns and known ways you would go about this. I knew my goal, which was to have a skill fixing all of these flaky specs. And it was something that agents are pretty good at when you give them a kind of testable goal. You know, this wasn't quite open-ended. And I also had this huge backlog or, yeah, there was a backlog of probably a few hundred. But then also all of this historical flaky spec information.

Starting point is 00:43:50 And so you can just harvest all of this data in your environment to go, hey, Claude, I'm going to build a skill. First of all, go and research every single flaky spec we've ever had. And then we're going to build a checklist. We're going to build a mechanism. And then we're just going to crunch through them over and over and over. get to this like one X kind of, you know, it's doing a good job, probably as good as job as I would do. But then as you keep building up all of these like little teeny steps, which are the kind of things that, you know, our best rails coders kind of do, they've got all the stuff in their

Starting point is 00:44:20 heads and all the different classifications of flaky specs and, you know, verifying it gets real data and, and then, but the really fun part is then you get, so you get something that's starting to be like 10x. It's fixing flaky specs that, I'm not. I'm not even sure if I could do. It might take me a day or something. And I probably wouldn't do it. But then you start to add in stuff into the skill along lines of like, okay, when you fix something and it's novel, you need to update yourself as well.

Starting point is 00:44:50 So in that session, it's updating the skill. So the skill itself is kind of learning as it goes along. And we also fan out. So it's like, okay, I'm very happy that you fixed that flaky spec. Now find every flaky spec that got impacted by that nature of it. And so I. I went from zero to like 100x in terms of this skill now is like, you know, see your distinguished engineer that role or being able to fix these specs. But it was more like the process that got there.

Starting point is 00:45:21 And so like working with a feedback loop, working with like a very clear goal. And then giving it the freedom to do it, you know, giving access to the systems where it needed to pull in metadata, been able to run bills itself. And having that feedback loop where it's learning and And then, you know, designing the skill as well so that it's, you have to edit it as every swath. It ends up taking up to much information that might confuse things. But then you break things out into like reference guides. So you're doing this like progressive discovery thing. And I've even accidentally pointed this skill as like a Python code base.

Starting point is 00:45:53 And Claude has just gone, like, it's just Python. I'd be able to go. And it kind of uses the knowledge that's applicable to us. And so again, this skill is not going to make, Intercoms, revenue go 100x. But it's now this like perfectly reliable thing that we really no longer

Starting point is 00:46:13 have to think about. Now we can expand out into many, many different areas. And we just have to maintain this. And the maintenance work for a skill like this just isn't much. And we have evals and stuff so that when we're upgrading models or maybe even move into cheaper models or whatever that we can make sure, yeah, this thing isn't

Starting point is 00:46:29 progressing. It's still working as well as we think it is. And we've got confidence and certainty that this is still a reliable building. block and again the constituent part to put when put together you've got like a very senior engineer who's able to get any work done in your environment and so yeah we can take a look at what it's doing oh it's asking me for permissions um should have you forgot to date make no mistakes dangerously skip permissions that's the rule on how i ai one thing while running i wanted to say is you know this skill is a perfect example of what i call the like and then AI workflow which is i

Starting point is 00:47:05 everybody like pull your skills and pull your workflows through a bunch of amends so i want to fix flaky flaky tests so i go to gethub i find a flaky test i run through the skit let's say you fix it and then what would you do well i would document how i fixed it and then what would you do well i would go find all the other ones that are just like this and fix that and then what would you do i would go from you know our rails code base to our python code base and apply the same you can just do that over and over. And because the cost of running these is so low, you can actually pull the thread of a bunch of stuff any reasonable human would have quit at step one because you're not limited, again, by headcount or coordination costs. You're limited by the technical capacity to

Starting point is 00:47:51 solve the problem, which I think is a really interesting way to think about how you get from like the, you know, engineering intern that whose job is to go through and take a first, you know, gentle pass at all these flaky tests through to the distinguished engineer who has just speed run through 300 of them and has thought of a completely different way to architect your testing overall in your repo. So I think that's a really great model for things. And then the other thing is like, again, engineers, go speed run your tech debt, fix your flaky tech. Like these are all things that as somebody who has run engineering organizations, I have heard over and, over. We can't because our code base, blah, blah, blah, blah, blah, blah, like, can we pretty

Starting point is 00:48:38 please allocate this amount of time to just fixing this really annoying front end flaky test? Like, you don't have to ask permission for that stuff anymore because there's just a new way to solve it. And I think, again, just going back to some of the stuff we were talking about earlier, I think your overall product quality is going to go up. I think your overall developer experience is going up. There's just so many good things that come out of using these tools and using them correctly. Yeah, I think backlog zero is a realistic thing for teams to be able to go after. You know, all the things that you wish you would ever wanted to do, you know, it's now just achievable. But of course, you've got to balance it with, you know, all of the extra stuff that you can just

Starting point is 00:49:18 deliver at the same time. But it's so sweet to be able to think that, hey, we actually have a path to getting rid of all of our backlogs and all of the kind of architecture changes or whatever. you know, we can, recently I was taking a Go microservice and re-implementation it and Ruby. I was a single cloud code session. Before November, this was something that I would have had to advocate for on a roadmap and like, you know, plant some seeds and different engineers, heads and kind of know people towards it and kind of blame a lot of problems on the existence of his microchapter. Wait, trigger warning first before you talk about that process.

Starting point is 00:49:57 Sorry, I'm giving the secret sauce here of how to influence an org. But now it's like, well, I don't even have to think about this now. It's a single session. And in fact, I can get Cloud to implement it five times and compare the styles or compare the, you know, get us to review them and figure out what the best way of implementing the thing is. And this is just like this level of kind of creativity and freedom that where like your imagination is the blocker, not the time it takes to actually knock out one of these things, which was months in the past, you know? I completely agree. And I feel this at chat parity where people are like, what are you, I mean, I'm a product tool for product people.

Starting point is 00:50:36 They're always asking what my roadmap is. I literally don't have a roadmap. We burn down the roadmap every week. And then we figure out what we're going to ship next. And of course, we have thematic ideas we want to pursue and things that are larger. And one of the things that I do to keep myself from overshipping absent product market fit is literally constrain the ideas to what I can do in my brain, which is there's like a natural throttle on not getting slop out because it's not engineering broadling me. It's actually just good commercializable ideas. And I think that's where we're going to see some of the limits start to come and play. Again, referring to Anthropic, another big news piece came out.

Starting point is 00:51:20 that they're hiring a bunch of PMs because they have so much engineering capacity. They're actually limited at the PM capacity. And so it'll be interesting to see where the bottlenecks in your business, you know, end up. And which bottlenecks are appropriate. It's probably good to have a product bottleneck a little bit because then you're not shipping anything which customers can absorb. And so I just, I think it's going to, and it's going to evolve over time. And then, you know, product is going to have a whole set of skills.

Starting point is 00:51:49 And then I don't know. what we're going to do with our time, hang out on the beach. But I think it's a pretty interesting time to run orgs. Yeah, you know, I think engineers, designers, product managers, maybe it's just all going to be one blob of builders or something like that. And everyone, everyone just does things. Everyone just does things. Yeah.

Starting point is 00:52:11 And, you know, it's great. It's lowering the barriers to like just getting a lot of stuff done. And it's like so much fun when you can. When you don't have to ask somebody or get something on a backlog or whatever, you can just get it done yourself or even just get it done very fast in a small group. It doesn't matter what your discipline is. It's just like a great leveler at the moment. So yeah, so we're live.

Starting point is 00:52:33 I think our lobster is live. And it should be on app.com lobster emoji. Look at that. It was amazing. I need to get you all an affiliate code, you know. Yeah, I mean, lobster emojis, they're the new thing. They're the new growth hack. are the new growth hag.

Starting point is 00:52:51 Okay, so we have seen your PR per R&D employee go up. We've seen how you can get from kind of Claudeco to production very, very fast with a bunch of guardrails. We've seen your list of it looks like hundreds of skills, but at least dozens of skills that you're invoking via hooks. You're using that to not only ship customer-facing product, but you're also using that just to make developer experience better, burn down tech debt. all those things we want to see. You all are, you're measuring it both from a telemetry perspective, both like quantitative and qualitatively. You're measuring your cloud code sessions. And, you know, 2x isn't enough. You're going to get to 10x. So you all are on the edge, at least for for folks that I talk to, and I'm sure you're like me where you're like, sure, you think we're on the edge,

Starting point is 00:53:42 but then I see people and they're really on the edge. So we always have ambitions to move forward. But But my question now to you is, how has this impacted how you think about your customer's product? You know, I'm an intercom customer. I'm a thin customer. I interact with intercom code and intercom UI literally every day. My open claw has an intercom API key. How do you think about, you know, now that you have this experience with Claude Code internally, how do you think about what that customer experience is going to look like?

Starting point is 00:54:12 Yeah, there's a few things going on. One is fast people are outsourcing a lot. of decisions to their agents. And this is a good thing in many cases, but, you know, there was good research done recently about what does cloud code pick? And certainly I've had the experience in the distant past where I'd ask an agent to add something except do behind a feature flag. And then it would start to go and implement its own feature flag system. No, no, no. In our code base, which has a pretty sophisticated, old school home-rolled feature flag system. So, you know, nowadays, mostly will stick to whatever is in the codebase and that's fine.

Starting point is 00:54:53 But, you know, SaaS products, they're really good at their jobs. They're actually worth paying money for. And getting back to the feature flag situation, you know, if you're building a new business, you're relying on your agent to make decisions. Often an agent will, when prompted, it's like, hey, how should I solve a feature flag problem? I want to make sure I'm doing all these safe deployes and that. The agent will just go, yeah, I'll do it myself. And the kind of build over by decision. And you can see why the agents do it this way, because they can achieve this. They can get it done.

Starting point is 00:55:29 They don't have to rely on the human. Okay, like open claw changes things here a little bit and maybe computer use does as well. But still, we're not, we haven't really adopted SaaS businesses to be agent-friendly. And that means, well, all sorts of things around. and how do we position our websites and content and how do you get updated in their knowledge and how do they discover it? But also, can they actually just get it done?

Starting point is 00:55:55 Like, can you ask an agent, hey, could you just sign me up to Intercom and get me in working on my website? And so, like, this goes alongside just having to make more APIs for things. I think I'm kind of like Omni Channel as such. I think like there's a feature for CLIs and MCP and like Rest APIs.

Starting point is 00:56:17 I think I'd like us to get more comfortable around things like ephemeral APIs or multi-step APIs. I think CLIs are good at wrapping these kind of things. But the whole point of all this, where I'm getting at is like, you know, you want to be able to just help agents out at the time when they're interacting, they're in discovery mode, and you want to give them clues, you want to give them hints, you want to give them help to be able to do things like sign up for something fully without having to go back to the user and say, yeah, sorry, can't help you there.

Starting point is 00:56:43 you've got to go away and like figure out how to sign up for something. So I've been working on something in the last few weeks, which hopefully you should solve a problem. I can paste in a prompt and then see how far it gets. I also just while we're running this, I have to go back to your feature flag example because it, you know where I used to work. It broke my heart that build it yourself was at the top of the feature flagging list.

Starting point is 00:57:10 But I do think I have a paranoia. moment about this, which is model providers and harness providers are highly incentivized to build it yourself consumes lots of tokens versus buy it, maybe consumes less. So I'm just really interesting to see how this all shakes out. You know, people, people are very anti-SAS is dead. And I'm a little bit more like, yeah, but like the current form factor of SaaS really is, has something coming for it in a particular dev tools because these models are so good at writing code. I think you're in a real pickle to try to figure out how to find the right value wedge at the right moment, how you can allow agents to not just sign up and set up things, but purchase it, you know, like what does your

Starting point is 00:58:06 trial experience look like if your first user is an agent. I think all of that is super important. And then, you know, to your point earlier where you said, you know, are we APIs, ephemeral APIs, CLIs, MCP? I think the answer is yes right now, which is you cannot predict the medium by which a user is going to come to your site. They could come through a search and hit your website and download things and look through your docs. They could come through cloud code. They could come through an open clot, you just really don't know. And so you sort of have to meet your customers and your non-human customers where they're at. And I think it's really smart for teams that have any part of their product that needs to be implemented via code to be thinking about this problem yesterday,

Starting point is 00:58:54 because you will be left behind, I think, if your agent experience isn't there. Yeah, agree entirely. And I think there's a whole craft in how to make, say, a CLI, like, agent-friendly. I think like MCPs obviously get that right a lot of the time. But, you know, for example, one of the things that we do and the help is, like, kind of just give a hint to the agent. It's almost like prompt injection to a certain extent, except it's not malicious. You're just trying to get it along to what is trying to achieve.

Starting point is 00:59:23 It's like, well, maybe you could check email. And if an agent has access to your email. That's what I was looking at. Yeah. So it's just they're going, you know, I can probably get this done. Or like you can hint to them like, I've kind of cheated with this. So this is my own personal website

Starting point is 00:59:37 hosted in Rasell. And it is, I've kind of pre-populated a few articles so they can upload and Finn has some contents to answer questions with. But you can also just, you know, return in the help going like, hey, you know, you should probably think about creating some articles

Starting point is 00:59:54 if you want Finn to actually start answering questions. And that can be an extracted, from, you know, to codebase or whatever. Yeah, being like, I've been also think like a lot of interfaces, like CLI interfaces, like I use Gog. You know, it's part of the open claw universe. And I think it's a lot better than the official Google DWS one. And but I think if you start to use it, it's actually just more human.

Starting point is 01:00:22 As in it's the interface just kind of makes more sense to a human. And I think the Google one is like, I kind of get what they're getting at and there's kind of Jason in there and stuff like that. It's not that, but it feels more human friendly or something, things that are effective for agents can often be things that are more human friendly because they're discoverable, these verbs and words and not just kind of inscrutable weird stuff going on in command line options. I think I've confused Claude here. I'm not sure what, where is this? That's okay. I'm going to, I'm going to narrate for folks what's happening here, which is you basically said like install intercom. on this site, there's an intercom cly that's like, cool, I can access the intercom APIs and do a lot of this.

Starting point is 01:01:06 My favorite part of it, though, is signing up, getting a verification email in your email address, invoking via like this hint, basically, of like if the user has email access set up in however you're accessing it, go check for this verification email because we have a code in there that we got a snag. And because you're using Gog, which is a command line tool to access Google workspace, you can go do that, pull that code in. And what I think is so interesting about that particular flow is, you know, I think AI is creating sort of race conditions in shipping across the org, which is like you can yolo a CLI probably faster than whatever team that manages email authentication can change how email verification works. And so you're like, I'm not going to let that

Starting point is 01:01:57 break my product. What I'm going to do is create a flow that I can use that sort of sticky part in the flow, AI brains, and get through it. And so again, your product doesn't have to be perfect for an agent to traverse it. And this is one of the things I'm actually really excited about SaaS is all those things that are just so complicated to do as a human, multi- step forms and like nested fields on nested fields and finding, you know, categories and just those things that I would say UX designers and product managers have written their most tedious PRDs on and done their most detailed specs on. Like you don't actually have to worry about making that quote unquote usable because you can just brute force intelligence against it and and solve the

Starting point is 01:02:46 problem. And so I think that's interesting because the core value proposition can get bigger and bigger without being constrained by the surface area of a website or a UI or any of those things. And so I think if you're not thinking about what does that CLI look like for you and what adjacent systems does your product butt up against? It may be email. It may be some other dependency and how an agent might traverse those systems. You're just going to get less and less adoption because this is going to be more how people install products.

Starting point is 01:03:19 Yeah, and if I don't poke holes, and if I don't make a CLI that kind of bypasses some of the ways for the product works, somebody else will. You know, they'll just put their own agents on us and they'll burn more tokens. They might get frustrated. You may as well shortcut them and give them an interface which just works. It may not be the perfect interface, but that's the beauty of these things. You can get updated over time. You can, agents can just pull down the latest version. And yeah, like, hopefully I have something to show here, though.

Starting point is 01:03:49 Well, the other thing that I want to call out while you're talking about that, which is, as I'm watching this and it's taking some time to build, your conversion rate drop off point is somebody pressing the escape button and just saying, forget it. Like, this is clearly not working. What if we built it ourselves? And so I think it's a really interesting moment for product managers who right now are not getting the visibility of the drop off. Right. When you were going through a website, you could put telemetry. in it. You could say, okay, users going to the signup page, drop off, email verification drop off, going to the docs, drop off. You could build this nice little funnel that identifies where your users are having problems. You can put some telemetry in your CLI, but the end of the day, some of that drop off and the alternatives is very invisible to you here. And the switching cost,

Starting point is 01:04:40 quote unquote, is like pressing escape and saying, do it a different way. And so again, how quickly you can speed run to a zero to one installation in an agent, I think is something that everybody should be running right now. And it doesn't just have to be a code product. Like I think more and more people are doing non-technical tasks and interacting with non-technical SaaS in Claude code, in Claude co-work. And so, you know, even if you're not dev tools, if you're not thinking about how can a user do this quickly in a third-party harness or system or an agent can do this quickly, you're really missing out on customer growth. Okay, how are we doing? It's on its fourth attempt. That's fine. And you know what? Let's press the escape because you

Starting point is 01:05:34 know what? Let me tell you how cheap that exercise was. It was like five minutes and some tokens. and you're going to spin up a fresh clod code. I don't know if you put Make No Mistakes. That was probably what we missed. Make no mistakes. And it could have done it. And again, this is just learning. Like, why aren't, why isn't every engineer every PM doing this once a week or once a

Starting point is 01:06:00 month just to figure out how it can work? And it's great. So, Ryan, you've shown us everything. You've given us all, all the secrets. Let's get out of the tree. terminal and let's do some lightning round questions. So my first question for you is how does it feel? Because what I observe from our conversation is it feels fun. Like culture has in fact gotten better, not worse because of this investment. And so, you know, as a company that has really put in the

Starting point is 01:06:35 effort, both on the on the customer side and internally, how do you think it's shifted culture? Has it at all? What have you observed? Yeah, everything is just faster and more exciting. You know, I mentioned feedback loops a good few times and, you know, you can just get stuff out there so fast now. And I've been having the most amount of fun in my career over the last three months or something like that. And like, it's fun in many ways. It's fun because I can do stuff that, again, I would have had to convince other people to do or they were just things on my wish list and I could never get around to them. I just kind of complain about them. them. But now they're just realizable, but also the fun aspect of like making other people productive, like leveling people up, like removing work. I had like, uh, Intercom's pretty good culture around resisting like the kind of slow movements towards being a large company and all this process and stuff like that. We're kind of in denial that we're like a large company. I think it's a healthy way to work in many ways. And, but this has kind of got us back to our roots in a loss that you know you can make fast decisions and get them delivered and get that feedback

Starting point is 01:07:47 super fast and I've been able to like ship actual features like not just the CLI but I ship ship some webhuff features and it's been a long time since I've done that I'm just I've been in the weeds in platform space for a long time and but it wasn't even a big deal it was like just a couple of hours just kind of get something done it was like something a customer asked for so my job has become more varied. I'm able to kind of see more and get more done and help other people get a lot more done. So you get this kind of excitement and velocity increases and, you know, we have all those measurements and that's all kind of good stuff. But just the excitement of waking up and morning going like, I'm going to get a lot done today. Like that is a fun way to go about your day.

Starting point is 01:08:28 I completely agree. And I hear this over and over and over again. I certainly feel it myself, which is this is the, it brings me back to why I learned it learned to code. It's like that same moment. of I didn't learn to code because I like to type code. I learned a code because of the magic of you running like hello world and it shows up somewhere and that feels so it's just a very creative experience, which leads us to my second question, which is I see all the time that one of the most impactful change agents inside an engineering organization can be a senior principal engineer saying let's go ham on some AI code and the single most blocking person in the organization can be a senior principal engineer going, I don't believe it. Absolutely not,

Starting point is 01:09:12 not me, not here, no way. And in fact, last week, I heard a story of somebody who had their most senior staff engineer quit. It says, and I quote, I do not believe in AI. I will not work in a place that does this. So what is your appeal, sort of engineer to engineer of why to invest in this? Why you think it's the way that engineer organizations are moving and how you kind of come to meet skeptics where they are and hopefully see things a little bit more from where kind of Intercom is approaching them. I mentioned that Intercom kind of had it on easy modes. We didn't have to convince leadership that there's something to this AI stuff. We were pretty much, had decided the direction of the company, the weekends that Chat ChpT came out. So we already had this expectation

Starting point is 01:10:00 that this will be transformative across many parts of our work, including all of building products in engineering. We were just kind of mostly annoyed about how long it took. But I think for sure, it does need strong advocates. And you need to push boundaries. Like one of the biggest things that I've been able to do successfully was kind of push through the barrier of like, should we let an agent connect a snowflake? Like what like, and there's all these things can go wrong. Or should we let our agent run real production code in our Rails console over API? And the easiest thing to answer there is like, well, you know, I'm not sure. Or like this is risky.

Starting point is 01:10:41 Or we should think about this. But we've been largely pushing through it. And now like not recklessly. Like we've lots of good controls and we're a mature business. And we have like I've been on our security team, but definitely not trying to do anything too wild. But there's still, even then I have apprehension. It's like, I think I think we should do this. But it seems weird.

Starting point is 01:11:04 seems hard. But then I just have to give myself permission. And then I realize, if I have to keep myself permission, there's loads of people out there who just need me permission. And honestly, like, one of the biggest things I do at Intercom is just telling people they can do things. And there's a pre-AI and post-AI and or telling them like, look, whatever you do, just blame me if it all goes wrong. And I guess maybe we can blame Claude now, but ultimately it's that like permission and just like there's a level of ambition which comes from it as well. It's like if you if you're out there saying I'm not sure if AI is going to take or have a big role to play in all of our work. And you keep on saying that. That kind of will

Starting point is 01:11:45 permeate through the culture and people to say that. If you're very clear you say that like look all work is going to be agent first like at some stage in the near future. And so we're going to figure out the path there. And so we're going to break down every barrier as we come across them. And look, it's your job, it's my job. And if anything is wrong blame me. Like that's largely than how I've been approaching, but not just me. Like this has been a very large collective efforts, but giving that kind of permission thing, but also the kind of,

Starting point is 01:12:12 uh, like freedom to like explore or push things or whatever. It's kind of necessary and look. It might be a less stressful way to go about it to like just take a nap for a few years and come back and then with all the problems have been solved. And we've got these perfect agents, uh, running a muck in our environments. Then,

Starting point is 01:12:31 then that would avoid some of this. Like, I think all places have to get through that kind of apprehension and initial kind of issues that some of these can, some of the introduction of agents and industry environments can have. And I think our job as leaders, whether it's as an engineer or as a manager or whatever, just has to be on that like enablement and giving people space to to go deep on the work, enjoy it. And like, have that moment where things click and you start realizing like, oh my God, this is something that will transform how much I can get done. say it again for the people in the back. I love, I was like, oh my gosh, I love this so much. And, you know, it is absolutely those two things, which is like, give permission. You can. Please just go. Please, by all means. Go ahead. Designer, hit me with a PR. No one's going to get mad at you. Like, go ahead. And then the second thing of just accountability can roll to the top. And not in a scary way.

Starting point is 01:13:28 Let's not do irresponsible things. But, you know, I've seen a couple of things. I've seen a couple. incidents in the past months, some big ones. And what you see as CEOs or big leaders coming out and saying like the team's shipping and we want to keep shipping and we're going to be careful with our customer data and we care for the customer experience. And stuff happens. We've learned from it. It's ultimately on me. I'm going to call the customers and we're going to move on and deliver great innovation for you. And you know what I tell people to, you know, to get them over that hump, which is like, you really got to know what you're excellent. existential problem is. And I love what you said is the second that chat GPT came out, intercom changed,

Starting point is 01:14:09 because that is an existential problem. Who writes the code in your code base, agents or humans, not an existential problem? Like, will you be fundamentally disrupted by a new technology? That is the real problem in your business. So I always tell people, like, let's differentiate the real problems in our business from problems that we can tolerate and then go go use the problems we can tolerate to move fast. And so it sounds like you have a really good call. I mean, I think at the end of the day, the results speak for themselves. And again, you all are not asking me to say this. Intercom has meant the moment. You went all in on AI assisted, you know, customer support and experience. You're now building models. And so it's not just a one and done. Chad GPTs here, we need to change how our

Starting point is 01:14:54 product works or AI assistive coding is here. So we need to change how our engineering team works. It's, you know, models are going to be how people differentiate. We need to go there. CLEs are going to be how people use products. We need to go there. And so I think this sort of like fearlessness and what I would suspect is like just a fun, nice high trust culture, good people. You actually see the business results on the other side.

Starting point is 01:15:17 So I'm going to hype you up. I see a lot of teams. I see a lot of leaders. And I think people can take a lot of inspiration from this. But let's uninspire them really quickly before I get you out of here, which is my last question, which is when Finn takes. 15 solid minutes on a live podcast to do a very basic task that you know what can do. Or not Finn, when clog code.

Starting point is 01:15:39 Yep. What do you do? Do you yell? Are you a yeller? What does your, what does your meta analysis on this internal dashboard say? The human needs don't prove on. I do lapse into giving clog code like just like smiley faces or unhappy faces or, you know, not over the top. I certainly haven't cursed at it.

Starting point is 01:16:01 very polite That's kind of not my spoil but I do like the odd kind of like at a boy kind of smiley face and I don't know if it knows like that I'm deeply thinking about this and like these little subtle kind of hints or whatever but yeah no I think like

Starting point is 01:16:17 professional with a few emojis is my style with Claude and you know hopefully not will come back to me someday with an emoji same I waste the tokens on telling it it did a good job I somehow in my mind I'm like that's going into into its own sense of itself and it's going to know what good looks like. So I am there, I am there with you. All right, Brian, this has been one of my favorites. Y'all, if you have gotten to the end,

Starting point is 01:16:43 there is so much alpha in this episode. I cannot believe it. This is a cheat code to winning friends and influencing SaaS through AI engineering. Brian, where can we find you? And how can be helpful? I can be found on the internet as a nice vanity URL, which is Brom. and scanlan.i.e. And I got a few links here to some other talks and similar writing and different bits and bots. As you can tell, I'm not a designer. I asked Claude to design this as if

Starting point is 01:17:11 I was a Unix Systems administrator writing a little webpage and it kind of shows. I'm active on X Twitter and Brian underscore Scanlan. I'm on LinkedIn, Scanlan B or something like that. I think I'm the most famous Brian Scanlan on the internet. So generally, Brian Scanlan in, that tends to work. And I tend to be active and like showing up to different

Starting point is 01:17:31 conferences and just like getting good word out about what we do at intercom, mostly these days AI, but I've also given lots of talks about many other different topics. And yeah, I'm also a big believer in just saying yes to a lot of things. So if you look me up, you've got a good idea, you want to get in touch, you want to run stuff past me or whatever. Chances are I'll say yes. And we can, I'll just keep on doing this until things break and then I start saying no. But I'm still not there yet. So bring it off. Great. So search for Brian and ask him to do something for you. That's it. Well, thank you so, I mean, thank you, truly, for sharing all this information. People are going to get tons of value out of this. It's going

Starting point is 01:18:09 to be a hit for sure. And I just really appreciate you joining How IEI. Of course. This is so much one. Thanks so much for watching. If you enjoyed this show, please like and subscribe here on YouTube or even better, leave us a comment with your thoughts. You can also find this podcast on Apple Podcasts, Spotify, or your favorite podcast app. Please consider leaving us a rating and review, which will help others find the show. You can see all our episodes and learn more about the show at how IAIIPod.com. See you next time.

How I AI - How Intercom 2x’d their engineering velocity in 9 months with Claude Code | Brian Scanlan

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.