How I AI - “I haven’t written a single line of front-end code in 3 months”: How Notion’s design team uses Claude Code to prototype

Starting point is 00:00:00 The way I think about designing B2B SaaS is you want your designs to encounter reality as early as possible. I've always been into prototyping and then all of a sudden these AI coding tools come along and now I can prototype faster. I can prototype in production. So explain to us what this prototype playground is. It's just the next JS app. All of our prototypes are in one place. Seeing what other people are working on is really fun and interesting. And oftentimes you spot cool ideas and you're like, ooh, I want to try that.

Starting point is 00:00:27 The code is all in one place. It's just in one repo, and so I can just yank cool ideas from other people's prototypes and incorporate them into mine. Every time somebody is like a little anti-AI-assisted coding, I'm like, do you know that I used to have to walk uphill both ways for my CSS? It was not fun to do this. I mean, even just sitting here watching this, I still just find this magical. Welcome back to How I AI. I'm Claire Vow, product leader, and AI obsessive, here on a mission to help you build better with these new tools. Today we have a designer-centric episode with Brian Loven, designer at Notion AI, who's going to show us how he set up a prototype playground for the entire Notion design team to vibe code using ClaudeCode, any prototype they need.

Starting point is 00:01:17 This is a great one for someone looking to either shift their design organization into a code-first prototyping mode or learn some advanced techniques with ClaudeCode. Let's get to it. This episode is brought to you by WorkOS. AI has already changed how we work. Tools are helping teams write better code, analyze customer data, and even handle support tickets automatically. But there's a catch.

Starting point is 00:01:41 These tools only work well when they have deep access to company systems. Your co-pilot needs to see your entire code base. Your chatbot needs to search across internal docs. And for enterprise buyers, that raises serious security concerns. That's why these apps face intense. since IT scrutiny from day one. To pass, they need secure authentication, access controls, audit logs, the whole suite of enterprise features.

Starting point is 00:02:07 Building all that from scratch, it's a massive lift. That's where WorkOS comes in. WorkOS gives you drop-in APIs for enterprise features, so your app can become enterprise-ready and scale upmarket faster. Think of it like Stripe for Enterprise Features. Open AI, perplexity, and cursor are already using WorkOS to move fast. and meet enterprise demands. Join them and hundreds of other industry leaders at workOS.com. Start building today. Brian, welcome to how IAI. What I am so excited about in terms of our conversation

Starting point is 00:02:44 today is you're going to show us about how one of the best designed products out there, Notion, is being designed by people like you using these new AI tools like ClaudeCode. So, why did you make this shift to how you were doing design, what it meant to prototype design and build things, especially for a product and in a company who values design so highly? The way I think about designing B2B SaaS is you want your designs to encounter reality as early as possible. And if you imagine this gradient of like I'm scribbling on a napkin on one side to I'm shipping to production and showing customers on the other side. Our goal is designers is to move up that gradient towards prod as quickly as possible.

Starting point is 00:03:36 So I'd say for most of my career, I've sort of biased towards being interested in programming, mostly at the prototyping level. I just find that when you're designing something in Figma and then you actually try it in the browser, in the browser you notice a ton of problems. you know all of a sudden you're clicking things you notice loading states you notice oh that didn't quite work on this screen size so you encounter some version of reality sooner and you end up getting to a better design more quickly so you know i've always been into prototyping and then all of a sudden these AI coding tools come along and now i can prototype faster i can prototype in production

Starting point is 00:04:16 i can uh or what i most often do now at notion is just prototype in a little internal what we've built called prototype playground. And again, the idea is just like, how do we get something that's somewhat realistic in a kind of real environment, in our case the browser, as quickly as possible? And I think that just helps you move faster and end up with better designs. So explain to us what this prototype playground is and how you set it up and how you might use it. Okay, so prototype playground is nothing magical. It is just a next JS project. So I actually here, just source apps and there's an app directory. And you'll notice here in our app directory, we're normally in a NextJS app, you would see pages.

Starting point is 00:05:08 Well, we've just namespaced every designer on the team or PM or engineer, whoever signs up and wants to use it, can just namespace some directory. So here's Brian. And then every directory inside of that is some prototype. And so it's just a next JS app, but each page is sort of a standalone. There's no global layout. There's no global, I don't know, like structure that you have to adhere to. And so what that looks like on the front end is this.

Starting point is 00:05:37 This is what we call prototype playground. And it's just a list of prototypes ordered by who was working on stuff recently. So here's a few from December and then a bunch from November. And it's really cool because having everybody's prototypes in one place is useful in two dimensions. One, just from a visibility point of view, seeing what other people are working on is really fun and interesting. And oftentimes you spot cool ideas, and you're like,

Starting point is 00:06:01 ooh, I want to try that. And then on the other dimension is like, if you spot a cool idea and you want to try it, the code is all in one place. It's just in one repo. And so I can just yoink cool ideas from other people's prototypes and incorporate them into mine. Usually by just telling Claude to do that.

Starting point is 00:06:20 Yeah, I think before prototype playground, there was a lot of designers at Notion who prototyped in code. The difference was we were all creating our own repository, our own Nextjs instance. And so it was hard to know where everyone's stuff was. Everyone was rebuilding it in different ways. Or if people were trying to recreate something that looked Notion-E, we were all doing it from scratch. So anyways, Prototype Playground, NextJS app. All of our prototypes are in one place.

Starting point is 00:06:47 And then we have a few shared components and shared styles. So if you want to make something that looks notion-e, you can do that pretty quickly. So, for example, we have some templates here. I can show you like notion UI is just a notion-y sidebar. And actually, this isn't even very notiony. I think at some point I slipped this new button in here, which obviously doesn't exist in the product. I don't think these things do anything. But it's close enough.

Starting point is 00:07:14 If you're like, oh, I needed to prototype something with a notion sidebar, I can just come in here and duplicate this template. it. And then we, of course, pull in a bunch of our colors, typography, icons so that, again, just getting to close enough notion styles without a whole lot of effort. Yeah. And I want to call out for people a couple, many episodes ago, I showed how you could build a very similar next JS app for yourself that had a combination of docs you were working on, markdown docs you were working on, and prototypes in a very similar format where it was like, here's my folder of just stuff I'm working on. very minimal shared components, very minimal shared styles. I like this too because it's nice to have that team level organization so you can pop in and see who your teammates are working with. I have a question from an operational perspective. Did you set this up?

Starting point is 00:08:04 Was this like a passion project for you? Did engineering set it up for you? Like how did this actually get created? Yeah. I set it up with another engineer. I mean, it's just the next JS app. But then operationally, just a few approvals. It's deployed on Verselles.

Starting point is 00:08:19 So we had to go through a little bit of process to get that project spun up, get a few people added as members. Otherwise, yeah, it's not that much. Again, it's just a pretty basic Next.js app, which you can literally use Claude to, like, help me make a NextJS app. And it's just going to get you the default. I like keyboard hands. Everybody does the same keyboard hands motion where it's just this. I have one more question, which is of the people now working in this repository, how many many before were working in code versus this is their first repository that they've cloned

Starting point is 00:08:56 to their desktop or deployed. Was the design team pretty technically adept already? So this was very natural or were there some people that needed to be onboarded? I think so. I mean, to be honest, prototype playground is still really for me. Like I think I use it the most. You can see here there's a bunch of other people that are creating things. but if you were to go through, I probably use it the most.

Starting point is 00:09:19 I think there's maybe like five to ten people at Notion that use it quite a lot. And then a bunch of people who either have tried it and it didn't stick. And we can get into reasons why that is or they're just not interested or they prototype separately, right? Like we still have people prototyping in Figma. We have people that prototype in their own code base still. They just prefer their own stack. Maybe they don't like NextJS.

Starting point is 00:09:45 Maybe they don't like React. so they do something else. And I think all that is totally fine. In fact, one of the features I added recently was this ability to link to an external prototype. So if you prefer using V0 or a Levable or a Figma Make file, whatever it might be, you could just link to it here.

Starting point is 00:10:02 And in fact, this is what it'll show up as in Prototype Playground, just have this little external icon. And so you can click it and it'll open in a new tab. So in theory, this could be the prototype playground or repo for any prototyping tool. My hope is that over time, we make this thing useful enough that more people will want to prototype in it because it's just faster than those other tools. And we got to figure

Starting point is 00:10:29 out how to lower the onboarding complexity for people who aren't technical before. So to answer your question, I don't know, I'd say some people who weren't technical made their very first code prototypes or like AI-assisted prototypes in the playground. But probably the majority of us that are still using it daily had some technical background. Got it. Perfect. Well, let's prototype something. I want to see how it actually works. Let's do it. Okay, so there's a few ways to make new things in XTS, right? Like we could be in cursor and we could come in here and create a new folder and create a bunch of page.tsx and metadata files and that sucks. I don't want to do it. Um, so there's two ways around that. The first is when you're running in local hosts, you can actually just click this button that says

Starting point is 00:11:20 new and you give your prototype a name and a description. I'll call this one, um, how I, A, I. And then this is for fun. And I create that and all that's doing under the hood, if we bounce back over to cursor is it just created those files for me on my computer. This is my favorite part is like there's no back end for prototype playground. It's just all files on disk. And then we can just push all this to GitHub. So here we have like a little metadata file. These get sort of collated to render the list on the homepage. We have an actual prototype file here with some code. And then this is kind of nice. Like it automatically gives you a button to open in cursors. And now I can just come in here and start prototyping. Now typically my workflow is

Starting point is 00:12:08 I just bust open Claude in the terminal. I know this isn't how you're supposed to use cursor, but it's just how I do it. It's probably not even how you're supposed to use Claude code, but I just do it. We're just equal opportunity offending these two tools. I know. I know. Sorry, everybody. But this is how I like to work.

Starting point is 00:12:29 And in fact, I have a little shortcut here where I can just press caps lock G, and then I can get these two things side by side. in my computer site usually. I'm clotting over here, reviewing the changes here, and then monitoring sort of the output over here. So let's see here. I want to make a prototype and I don't know, let's just come up with some contrived example.

Starting point is 00:12:55 Like maybe you can help me think of a good use case. Can we make a prototype for, oh, like a little, a video and audio, this may be complicated, video and audio like display module for my my podcasts video and audio so it's show like video and then maybe like an audio player let's see you know it's it's opus four or five i think you can do it okay let's try it so normally uh let's walk through like my actual workflow there's sort of two steps one is you can type a lot uh that's not that fun um i do use this tool called monologue where you can just talk to your computer there's many products like this i think monologue is just nice and cute

Starting point is 00:13:34 um so we can just talk and it's just much faster than typing our prompt The second thing you'll notice with CloudCode is I switched over to plan mode. I think it's really, really important to plan before doing anything. For whatever reason, you just get better outcomes. Now, the key thing about using plan mode is to actually read the plan. And I think this is where having a development background just gives you an edge, because you can read the plan and be like, oh, that part actually doesn't look quite right. Whereas if you maybe don't have as much programming experience, it would be harder

Starting point is 00:14:07 to tell that. But in either case, I still find that having the plan mode and creating some structure before actually writing code is better. So let's just do both of these things at the same time. So we're in plan mode and I'm going to invoke monologue here and it's recording. And so let's say, I want to build a new prototype in this how I AI directory. And we are a podcast and I want to build a detail page for a podcast episode that has both a video player and an audio player underneath. The page should have the title of the episode, description, and how about if you hit play, there's little confetti that shoots up out of the player. And so we end that. And now I will delete this. And we plan.

Starting point is 00:15:03 So I have to give you props on two things. One, I am also a plan mode slash, like, write your spec, write your PRD person, obviously. I think the second thing is I am still just such a read the code, read the outputs, girl, when it comes to AI. It's actually one of my challenges when I use something like Claudecode or watch people use Claude Code is if you don't do it inside a cursor or something that gives you this sort of, I love your three pain window. your code window, your quad window, your output window. Because I see people with like 17 tabs of quad code going, just accepting a bunch of changes. And I have to read. I think this is also just like engineering development background where you can just spot things that make no sense in the moment as opposed to having to go back and debug something.

Starting point is 00:15:54 So I am very much aligned with you on that. Yeah, it's helpful. And you know, this is probably obvious to a lot of people who are familiar with using cloud code, but maybe if you aren't like another piece that's really important here is getting the right context up front, right? Like we just typed in some prompts. But under the hood, I can show you we actually have some other files helping us out here. So we have a clod.md file at the root of our project with just some rough instructions around like the tooling that we use, like we use. like we use Bun, we use Tailwind. It has like a rough outline of the project structure.

Starting point is 00:16:35 Another thing that we do is anytime someone runs the project locally, we create a clod.local.md.m.m.m.m. And that local file is not committed to the Git repo. So it's personal per computer. And it adds a little bit of extra context. Like, hey, this is my username in Prototype Playground. It tells Claude where my directory is. And it gives some instructions like, hey,

Starting point is 00:16:59 you know, don't go around and mess with other people's files, like prefer to work in my directory and a little bit more about the workspace structure and how like individual projects can be built. So a couple of those things are working under the hood here. And while you're accepting some of these Claude code changes and questions, I do want to call this out for folks because I think people are pretty aware of the Quad MD global settings, but I think people forget that they're actually locally scoped. versions of these that you can implement. And so it's really useful to get one version deployed to everybody that gives you your master rules for using Claude. And then you can set up your own

Starting point is 00:17:40 custom one with your own particular preferences. And I think that's a really nice way to create a good collaborative environment where people are using a similar AI tool or agent to work in the repo. Totally. Yeah. Okay. I don't know. We'll see how this goes, but it's going to install some sort of confetti. It's going to have a player, audio player. This is really awesome. It does a wireframe in the plan, which is crazy. And here, I don't know, we can just kind of skim this for the sake of this example. This looks fine. So let's auto-accept edits. Now, I have a tip for people, because I think when you spend enough time on Twitter or watching other people use these coding tools. People are always like, how do you get it to run for longer? You know, they find

Starting point is 00:18:31 themselves constantly getting stuck or the agent does the wrong thing or it's asking for their input. And my philosophy on this has been any time the AI asks you to do something, you should, before responding, try your best to see if you could teach the AI to answer that question for itself. There's a good example. Oh, wow, that was very fast. Ooh, la la. Well, here, let's hold on that and see if the confetti works. Well, actually, here, the example is I've already taught Claude to, like, always lint itself after it's done, right? Like, what's really annoying is when it builds a bunch of stuff and then you go and look in your browser, and there's some error, right? So, for example, I've taught Claude, hey, check your work. One,

Starting point is 00:19:19 you can run commands like, what was this, like ESLint, yes, Lint, right? So, for example, I've taught Claude, hey, check your work. One, you can run commands like, what was this? Like, ESLint, right? and look for actual typescript errors. The second is you can give it access to MCP tools. So Playwright is one. The Chrome Dev tools, MCP is another one. And you can say, well, actually, you know, before installing this, Claude would say to you,

Starting point is 00:19:38 hey, I've implemented your feature. Go take a look at it and let me know what you think. And remember, our rule is anytime Claude tells you to do something, ask if you can teach it to do that thing for itself. So I don't want to have to look at the browser every time to see if I did it correctly. So instead I teach Claude, actually you should be the one to go and open the browser. So it knows how to launch Chrome.

Starting point is 00:20:01 It knows how to navigate here. It knows how to click the play button, look for confetti, make sure the audio is working, all that kind of stuff. And so now we were able to run this task for much longer without my input and actually get to something that is working well. I'm actually very impressed with this prototype. It's much more lovely than I thought it was going to end up. much more robust and the confetti looks great. The confetti looks great. Yeah. Well, here, I'll show you another example. This is, I think, where the power of MCP gets crazy. So let's, let's actually clear this. We're just going to start a new, new conversation here. I'm going to just totally undo everything.

Starting point is 00:20:42 Let's just start from scratch. So a couple other things that I've built in, you know, I think remember, like, I'm trying to make the onboarding flow as simple as possible for people on my team. Yep. So what Claude has is called slash commands. And you can just build these yourselves and they're basically glorified prompts, but they can also run scripts. And so we have some slash commands in the project that help people get going really quickly. So I have one called Create Prototype. And then you can give it an optional name.

Starting point is 00:21:15 So we'll call this one, How I, AI. And that's going to do the same thing as clicking the new button on the browser, which is what we did earlier. The difference, of course, is I don't have to click things. I kind of want to design this so that I basically live over in the terminal. And can you show us really quickly in your repo just how these commands get defined? Perfect. Thank you. Yeah, sure.

Starting point is 00:21:40 So, again, it's basically a glorified prompt. It has a name, a description. and then some instructions. So in our case, we say kind of how to come up with a name based on what the user provides, tells it where to look to determine the current user's username, how to create the new thing. It actually provides some sample code to use for both creating the page and the metadata file. I think I need to also approve this so it goes.

Starting point is 00:22:14 Let's just do blank. now as well as creating the metadata. So, you know, AI is better with good context, but it's also really, really good if you just provided examples of how to do things. So the reason it's important to provide these code snippets is to show it what success looks like, right? If this was like just instructions to create blank files, it wouldn't know what to create. So in our case, We're just showing it an example of success. And we could probably simplify this. It's actually quite a long command, but here we go.

Starting point is 00:22:52 So it created this and a blank piece of text. That's great. So that's just one way to start. You just type slash create a prototype and then that'll create. But maybe we have some design in Figma and we want to build this. This might not work, but let's try it. So we can connect to the Figma MCP, and I can just copy a link to this frame and say, like, let's build this notion UI. So before you could just paste a link to a Figma URL and try and manually invoke the Figma MCP,

Starting point is 00:23:33 and it would sometimes ask clarifying questions, and sometimes it would build it and then sort of stop halfway through. I don't like any of that. So we actually built a command called Slime. Figma and it roughly does a couple of things. The first is it actually checks that you have the MCP server installed and running, you know, for people on the team who have never done MCP stuff before. They might not know how to do this. And so it detects if you have it installed. And if it doesn't, if it finds that it's not installed, it'll just teach you how to do it. So it actually returns instructions to the user on how to set all this stuff up. And then it moves on to phase

Starting point is 00:24:11 A, designing or extracting the design from Figma, then it will implement it. And then the most important thing is we enter this third phase called the verification loop where it's going to open the browser and compare the implementation it created to the original Figma file. And I think my instructions are basically keep looping until you've gone through like two loops where there were no more changes. Oh yeah, here. Repeat until the implementation matches or after three iterations with no changes.

Starting point is 00:24:41 and then stop iterating. So let's just see what happens. I would say it gets like 80% correct, 80% of the time. But that's just, that's just how AI is right now. I was going to say about 60%. So I think that was right. Well, actually, you know, I think it is 60%, but this command and this loop and these instructions and like the pairing of the two MCPs actually gets us to 80%. I want to call call this out for folks because one of the things that I find most frustrating using MCPs even as a fairly sophisticated user is one you just have to use these like magic keywords to invoke the MCP and the right tool and the right thing and you know sometimes I have one of the challenges I have is I have a lot of MCPs that use the same tool names because so much across SaaS is named the same like everybody

Starting point is 00:25:32 has the concept of projects everybody has the concepts of pages or documents and so I like this idea of like force invoking a specific MCP via a slash command and not even just force invoking that specific MCP, but force invoking a specific set of tools in that MCP. Super, super useful. And then I will give you props for the instructions at the top that teach somebody, if you have no idea what you're doing here, how do you even get this thing installed? That's such a nice piece to add in as user experience for a consumer of this slash command that might not be you. And so that's something that people should really, really think about. Yeah, yeah.

Starting point is 00:26:15 I would say also it's funny because I've actually watched a bunch of these videos and looking even back at the ones from six months ago, it's crazy how far the tooling has come. And so I imagine that people who, for whatever reason, might be watching this video in six months, will look at what we're doing here and be like, oh, how naive it, you know, We've come so far, MCP is no longer a thing or something like that, right? And I kind of feel that way now where MCP is, it's like not the best thing, but it's the best we have so far, right?

Starting point is 00:26:50 Like it's very context inefficient. Sometimes it runs forever. Sometimes it just like blows up your context window. But it's the best we have right now. So even just watching this, right, like here's our design that got built. This was literally just pacing the link to the Figma4. file, no other custom instructions. And now over here on the right, it should be, I think I ran into an issue earlier.

Starting point is 00:27:13 Yeah, something got busted with this. Let's try the Chrome Dev Tools MCP again. I think I quit it midway through because it was detecting some conflict with the window. Anyways, this is pretty good by default. And then from here I'd iterate, you know, some things you might notice would be like there's no hover states, some of these images are broken, but those are. just an easy follow-up tasks. Well, and you're doing this from a kind of design perspective, but think about how many

Starting point is 00:27:42 engineers sit there and like pixel pull over Figma prototypes into the front end. And, you know, if you have a great design system, maybe that's easier to do. But it's not what the 27 seconds that we just watched to scaffold stuff out. And so I just think, you know, the friction reduction in these, you know, asset. to asset handoffs, which for my entire career, 20 plus years in tech, have been the most expensive part of implementing something where a designer gives you a design and then you have to get into the front end or the front end has to be hooked up to the back end. All those little pieces can be smoothed out and done much faster. And then you can spend the time on the optimizations,

Starting point is 00:28:27 the performance, the how it feels, how it works. And I think that's just really, it's really fun from a builder perspective. Totally. It's so fun. And yeah, I mean, even just sitting here watching this, I still just find this magical, right? Like, now that it's using the Chrome Dev Tools MCP, they, like, looped and fixed the broken images and, like, created this checklist of stuff. Like, okay, everything appears to be right.

Starting point is 00:28:52 It's got this bottom bar. These things are obviously wrong, but we could go and fix those with the follow-up prompt. But again, the goal is, like, can we get 80% in literally one prompt? I just paste it a link and it just iterated itself to. something that's roughly complete. I know. And every time somebody is like a little anti-AI-assisted coding, I'm like, do you know that I used to have to walk uphill both ways for my CSS? Like, it was not fun to do

Starting point is 00:29:16 this. Like, I find this just mesmerizing. This is so cool. This is great. This episode is brought to you by Orkus, the company behind open source Conductor, the platform-powering complex workflows and process orchestration for modern enterprise apps and agentic workflows. Legacy business process automation tools are breaking down. Siloed low-code platforms, outdated process management systems, and disconnected API management tools weren't built for today's event-driven, AI-powered cloud-native world. Orcus changes that. With Orcus Conductor, you get a modern orchestration layer that scales with high reliability, supports both visual and code-first development,

Starting point is 00:30:02 and brings human, AI, and systems together in real time. It's not just about tasks. It's about orchestrating everything. APIs, microservices, data pipelines, human-in-the-loop actions, and even autonomous agents. So build, test, and debug complex workflows with ease. Add human approvals, automate backend processes, and orchestrate agentic workflows

Starting point is 00:30:26 at enterprise scale, all while maintaining enterprise grade security, compliance, and observability. Whether you're modernizing legacy systems or scaling next-gen-AI-driven apps, Orcus helps you go from idea to production fast. Orchus. Orchistrate the future of work. Learn more and start building at Orkis.io. That's ork-es.i-o. Are there any other commands that you think are super useful? Yeah, yeah, I can show you a couple of So I want to scroll back up a little ways, actually. There was this step very early on where you can see it was running over and over again

Starting point is 00:31:10 this skill called Bun Run Claude Skills find icon. What's that? Well, if you look over here in our design, we actually have a bunch of very notion-specific icons, right? Like we have this AI face, we've got home, inbox. We have all of the icons in our project. The problem is AI is really, bad at estimating what the name of an icon should be, or rather it uses like the most

Starting point is 00:31:36 obvious name possible, which doesn't always match what's in code. So for example, like this face icon, there's no way AI would know what we call this. Or a very common one is it will, if you have like a search magnifying glass, right, it will just assume that it's called search icon, when in fact in our code it's called magnifying glass icon. And so this icon hallucination was getting really, really annoying. So I wrote a little skill called Find icon, and the skill basically says, like, anytime you're going to implement an icon,

Starting point is 00:32:14 first go and actually look through the whole project, but also look for synonyms or closely related words to the icon. So if you're going to look up something called search icon, Also try to search for magnifying glass icon. And it actually wrote a typescript to do this to just iterate through all of the files in our icons directory, which is like 5,000, right? It's a lot. So it would actually be very inefficient for it to try and load all of that up into context. It needs to write itself a script to do more effective searching.

Starting point is 00:32:47 So in that loop here, yeah, you can see it like found, it looked up magnifying and found the magnifying glass icon. It looked up inbox and it looked up gear and trash in order to get all these things correctly. Now, this only, this skill had to exist after all of us on the team just got really, really frustrated with it hallucinating over and over and over again. It's sad because it obviously missed these bottom three. It didn't get them correct. But the fact that it got these on the first pass is a huge step up. So the way I think about it is, you know, we have these commands that you run manually.

Starting point is 00:33:25 and skills are these capabilities that the AI should detect automatically and sort of use at the appropriate time. And it'll know to do that based on the description and title you've given it. So in this case, find icon and then how to search for icons. And of course, the best part is just letting it do things programmatically on your computer by calling actual coded scripts. So this was really helpful, saves us a lot of time and just fixing imports. and nope, search icon does not exist, those kinds of annoying, knowing steps. Well, what I like about this is, one, this is exactly what you would do to like a junior designer or engineer onboarding. You would like explain.

Starting point is 00:34:08 You'd be like, sometimes we call it search, but not really. It's magnifying glass. You just got to go find like the closest synonym. And the ability to be able to describe that to an agent or a skill or a tool and then let it do it programmatically for you is really useful. we do have a how I AI episode on Claude Skills, but one piece we don't go into in detail, which I think is really important is Claude Skills can be bundled with scripts.

Starting point is 00:34:32 And so the ability to combine both, you know, natural language prompting, which is in the skill dot markdown, with a set of programmatic tools in terms of scripts is a very powerful combination. And Claude's very good at making these. So,

Starting point is 00:34:49 oh, yeah. Like all this, like I did not type, a single line of code in this, right? Like, this is 100% like, hey, I just need this problem to be solved, create a skill for it. And then creating that skill also create a script so that you can work more effectively. Like, this is 100% prompted.

Starting point is 00:35:04 Show us your last command because I think this is a really useful one. Okay. This is fairly new. I think I merged this last week. Going back to sort of the problem with prototype playground. It's still a Nextjs app. It's still React and TypeScript and Git and branches. and it's just a lot of concepts to throw at someone who maybe is used to only prototyping in Figma

Starting point is 00:35:28 or they're intimidated by a terminal or code. And so I'm trying to figure out like how do we make this thing more approachable? How do we make it easier to onboard? But also not dumbed down, right? Like I want people to learn how to use computers. I want people to even subconsciously absorb the ideas. of Git and branching and pull requests and merging. So I don't know the best way to do that,

Starting point is 00:35:56 but my first attempt is to create this skill called, or this command called deploy. And deploy does basically two things. The first is it like goes through prerequisites and makes sure that it has the GitHub CLI tool installed on your computer and that you're authenticated. And if you're not, it like walks you through those steps, how to do it.

Starting point is 00:36:18 And then the second step is it, It will just walk you through step by step, how to get this prototype you've just created, deployed so that you can share the link with someone on your team. Let's see what happens. I'm going to try it now. I'm going to hit deploy and we'll see what happens. There's a couple of really cool loops in here

Starting point is 00:36:41 that I think save people a lot of time. So we can see it going through the prerequisite steps here. It's making sure I'm logged into GitHub. Now the first thing here, look, It's looking to see if I'm on a Git branch. It notices I'm not. I'm on Maine. And it shouldn't be doing that, right?

Starting point is 00:36:57 Like, we never want to push to Maine. So I think what it should do is help me create a new branch. And we'll see if it actually does it correctly. It's also trying to find some TypeScript errors and it's going to run some tests. I basically told it to do all this stuff because it's really annoying if you push code to GitHub, wait for all the checks there to pass. If they fail, then you've got to come back to your computer, fix stuff. Okay, great. So it created a branch.

Starting point is 00:37:27 Now it's staging our changes. Nice branch name. Perfect. Creating the commit. I love this. This is a great idea. I will also give my just like hack to learning Git for anybody who hasn't used it. I just love the GitHub desktop app.

Starting point is 00:37:45 It just like it gives you buttons for all of this. You can see your divs. You can like create. branches with buttons. So I think this is awesome. And if you are intimidated by the command line, there's like literally a beautifully designed desktop app that you can use. That's true. That's pretty nice. We'll now check this out. So it's created the PR. And in the instructions, I've told Claude, hey, whenever you create the PR, open it in the user's default browser. So now we have our PR opened here. And this check to deploy it to Ressel will fail.

Starting point is 00:38:20 but that's okay because I give it one more step here, and all this red looks scary, but it's not. I tell Claude to just monitor the CI every 30 seconds or every 60 seconds until all of the checks pass, and I tell them the specific checks that I care about. And if any of the checks fail, just fix yourself and then push the changes. So, you know, if people push something to GitHub and there's a TypeScript error, they'll see some error over here in the GitHub UI, they'll take a screenshot, they'll send it to me on Slack and be like, why is my thing not working? I want to just avoid that entirely. And going all the way back to my first principles, like if the AI is asking you to do something, like check the PR or tell me the CI status, you should really be thinking about how do I teach Claude to just do that for itself. So over here, this slash deploy command,

Starting point is 00:39:17 Literally is just end to end, I just sit back and watch it loop over and over and over again, checking its commit status, its CI status, making sure everything works. And then when all of the checkmarks over here are green, the script will stop. I think this is pretty awesome. I feel, I hope it lowers the barrier and like the intimidation factor of having to learn all these tools. But at the same time, you know, if you are curious, you can. just sort of read along and understand what's happening. It's like instructed to communicate and clear English what it's doing. My favorite part of this and it's not going to be what people think.

Starting point is 00:39:57 I think the slash command is amazing. I think running through all the pre-prochacts. Great. I love that you just open it up in a browser window. It's one of those things that, you know, even if you created the branch, created the poll request, said it was ready to go. People are like, okay, well, now what do I, like, now what do I do? And just forcing open the browser window and saying, like, this is where it lives on GitHub. My question is, do you have to get your code reviewed in Prototype Playground? For Prototype Playground, no. I mean, people can always ask for it, but no, we pretty much just YOLO merge.

Starting point is 00:40:35 I think the thing that I mostly check for is, like, did my PR accidentally mess up someone else's prototype? type. But again, like, that happened a couple of times and that was annoying. So then we created this Claude Local file that's like, important. Do not do this. You know, and that seems to have fixed the problem. So yeah, a lot of YOLO enable auto merge. And of course, it's not perfect. I don't know. It seems to be hallucinating some stuff here. Like it thinks it thinks these past, even though they haven't. I don't know. It's close. So I'm just going to zoom out. Every that we went over, you created a shared repo for your entire team where you could have name level directories, no database. We're just using metadata JSON and shared code to put different prototypes

Starting point is 00:41:26 inside this repo. You've set it up with both global clod rules as well as local clod rules, plus cloud commands and clod skills to sort of guide people along common paths. My favorite one is going to be Figma to code. It's so cool. Beautiful. It's so good. And then the number one rule that I've heard from you today is when asked to do something by Claude, teach Claude to do it. It's, it's so. Yeah. So you have this amazing prototype playground. You've set all this stuff up. How has,

Starting point is 00:42:07 let's just do a couple lightning round questions and get you on your way. And my first one, is how is this shift from doing things, you know, maybe exclusively in Figma or in these lower fidelity prototype models to really leaning on things like Claude, code-based prototyping. How has that changed the design team? Has it changed a small part of the design team? Do you feel like overall things in the organization are shifting in a way? How do you feel like it's changing the way people work together? I still use Figma. I probably still spend 60 to 70% of my time in Figma. You know, there's just certain things that you're making that don't need to be in the browser.

Starting point is 00:42:53 They don't need to be coded up. You can just look at it and be like, yeah, that's roughly right. We should just ship that. I find that as you're designing for things that use AI, that is not true, though. So, for example, if you were building a chat, or in my case I work on Notion AI. I don't think you can design a good chat experience in Figma. You can design what the chat input looks like. You could design a little chat bubble and a send button and like a drop down for model picker. I think all that's fine in Figma. But what you can't design

Starting point is 00:43:34 in Figma is what it actually will feel like to use that thing. I probably should have said this at the very beginning. But the reason prototype playground existed is because when I started working on Notion AI, I was literally designing conversations in Figma. You know, it was like, the user's going to say this, and then the AI is going to say this, and then it's going to work

Starting point is 00:43:54 perfectly and create a page or a database. And like, you mock these golden paths in Figma, and then the engineers go and they build it. And then it just doesn't work that way, right? You send a message, the AI gets stuck, or it asks,

Starting point is 00:44:11 a follow-up question or it does the wrong thing and you need to correct it. And prototype playground was for me a way to connect to real AI models and just start feeling out like, okay, how are the models going to work if I submit this kind of prompt? What happens if I connect it to the notion MCP? Doesn't even know how to create a page. What happens if it runs into an error? Oh, right, we need to design an error state for this. What happens if the model is thinking for two minutes and the user's staring at an empty chat screen? Like, what should we do in that intermediary time to help them feel confident that it's working, that it's doing the right thing. Is there any way to show incremental progress? I just found those things very, very hard to design in Figma.

Starting point is 00:44:52 So to go all the way back to answering your question, I think as more and more people are designing apps that both are for AI or incorporate AI in some way, they're going to need some other like native code first way of working to actually understand what the models can do. It feels honestly kind of bad. It feels like a lot of wasted time where every month the whole freaking industry has to learn like, oh, what are the new capabilities of this model 4.3.2 max pro? And then a month later, it's all irrelevant because the new thing has come out. And then you learn that. it feels like a waste of time. Unfortunately, I think it's necessary because the model capabilities are still advancing quite steadily with each release.

Starting point is 00:45:42 And it's really important as designers to understand what models are capable of doing so that we can create product experiences and designs that sort of live right at the edge of what the model is going to be able to do well. What's really frustrating is if you design something that's like, you know, oh, a user's just going to ask for a cool. website and it's going to be this perfect output website on the other side, models can't do it, right? Or they require a bunch of fine-tuning and sort of intermediary prompting to get that right. Designers just have to know what's going on under the hood there to design something that's plausible and possible. So I suppose the more products incorporate AI, the more designers will have to shift to thinking sort of prototype first, but probably prototype first with actual code under the hood where you can incorporate modern models and see where they break

Starting point is 00:46:42 and see where they're good and see where they're bad and actually form an opinion about which models are good for which things, that kind of stuff. So speaking of which models are good for which things, and you're using my current fave, my babe, Opus 4 or 5. Why, why, I mean, why Claude Code, why cursor in this non-curcery configuration? Tell me how you arrived at this is your tool stack. I need to play with more of the cursor stuff. I actually think cursor agent mode is pretty awesome. I've like clearly tried it a little bit. I just haven't gotten that far. The thing that I still really appreciate about cursor, I actually technically use both. Like if I have, I don't know, like some file and there's, there's like some error here. I still really appreciate being able to just hover over the error

Starting point is 00:47:33 and there's a button that says fix and chat. That's still faster than like copying and pacing it down into Cloud Code. So I actually use both a little bit. I just think Cloud Code does the best work. I don't know how else to describe it. There's this weird feeling as you use all the different models for different things. It's like to different people they just feel right. For me, Opus 4-5 is just insanely.

Starting point is 00:47:58 good at doing what I want. And I like the way it approaches problems. I like the way it plans. I like the way it executes. I like the way it communicates back to me and the follow-up questions it asks. And then, you know, why not use Opus 4.5 in cursor versus in the terminal UI? I don't know. I think this is just purely personal preference. Like some people look at this and they're like, this looks like shit. Like, give me buttons and UI and components and drop downs and things like that. And for me, I don't know. This just feels nice

Starting point is 00:48:32 and easy to read. As our friends over at every say, each model has a mouth feel. Yeah, yeah, exactly. And Claudecote and Opus have a good one. Okay, and then my very last question, because you are, you seem like a expert prompter. But when AI is not

Starting point is 00:48:49 listening, when it's not listening, when it makes up, you know, CI checks that passed where it didn't, it didn't actually pass. What is your prompting technique? Basically, I notice there's a direct correlation with how good of things I can make and how tired I am. And if I ever get to the point where, man, Claude just sucks. It's doing the wrong stuff. And I go back and I reread the thing that I said. I realize I made no sense. And so the best solution for me to write better prompts is like, go to bed, try again tomorrow.

Starting point is 00:49:23 which I don't know if that's a coughout answer. It's not actually writing better prompts, but your outputs just directly correlated with how good of context do you give the thing. And if you're giving it sleepy, tired, lazy, please fix this type commands. It's going to do bad work. I don't know if this is what you intended, but you gave me very good, both relationship and parenting advice there, which I'm thinking about. I was trying to ask my kid this morning to do something. I'm pretty tired. And I clearly, the inputs were not going to get the outputs that I want.

Starting point is 00:50:02 Well, it's easy. I mean, just go take a nap. Can't you do that at any point that you need? I love that. You know one of my favorite little agents, Devin, does have a sleep. You can send the agent to sleep. Maybe we just need the agents to send us to sleep. Well, Brian, this has been awesome.

Starting point is 00:50:19 Just a deep dive. I think in a very forward-looking view. into how design teams, as you say, especially ones that are going to be building AI products, are going to start doing their work. So where can we find you? And how can we be helpful to you in Notion? You can find me.

Starting point is 00:50:34 I'm mostly on Twitter or X, Brian underscore Levin, or my website, Brianleven.com. And then I work on Notion AI. And I think it's genuinely one of the few useful sort of knowledge work agents. So if you haven't tried it, try it and send me feedback. We're always trying to make it better, help it do more things better, faster. So try Notion AI.

Starting point is 00:51:01 And we're big fans of Notion AI too here at the podcast. So definitely give it a look and definitely get some feedback. And we will send it directly to Claude and put it in Prototype Playground. Brian, thank you for joining How IAI. Thank you for having me. Thanks so much for watching. If you enjoyed this show, please like and subscribe here on YouTube. or even better, leave us a comment with your thoughts.

Starting point is 00:51:24 You can also find this podcast on Apple Podcasts, Spotify, or your favorite podcast app. Please consider leaving us a rating and review, which will help others find the show. You can see all our episodes and learn more about the show at how IAIIPod.com. See you next time.

How I AI - “I haven’t written a single line of front-end code in 3 months”: How Notion’s design team uses Claude Code to prototype

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.