The Pragmatic Engineer - From Software Engineer to AI Engineer – with Janvi Kalra

Starting point is 00:00:00 You interviewed at 46 companies. What did you learn about what the market is like, what interviewing is like, what the whole scene is. In terms of the space, there are the product companies, infrastructure companies, and the model companies. I found it helpful to put companies in each category and figure out which segment you're most excited about to help narrow down the options, given that there's so many AI companies right now. Product companies are the companies building on top of the model. Here I think of cursor, codium, Hebia. Infrastructure companies are the companies building the tools to help AI product companies effectively use LLMs.

Starting point is 00:00:35 So whole suite of these. They're the inference providers like modal fireworks together. Vector database companies like Pine Cone, ChromaDB, V8, Evalon, Observability tools like Brain Trust, Arise, Galileo, and a whole other suite of products. And then there's the model companies, which are the base of the ecosystem building the intelligence. You have the big tech companies like Google, meta, building models. And then you also have startups like, or not startups, you have other smaller companies like Open AI, Anthropic building models as well. So that's how I kind of think about it.

Starting point is 00:01:05 So for me, in trying to be more focused in my search, I decided to focus on model and infrastructure companies because I wanted to keep getting breath in what I was doing. And I felt like the product companies were too similar to my experience at Coda, which was phenomenal, but I wanted to keep growing. And that definitely, the tradeoff was that as a bit more of an uphill battle because the work that I had done was not as relevant to model or infrastructure companies. What is AI engineering and what does it take to get hired as an AI engineer? Jambi Colorado as a software engineer turned AI engineer with four years of experience, but already a very impressive background. During college, she joined a university incubator where she shipped four mobile apps of production to paying customers. She then interned at Google and Microsoft, joined Koda as a software engineer, became one of the first AI engineers at the company, and then interviewed with 46 different AI startups and now works at OpenAI.

Starting point is 00:01:53 In today's conversation, we cover Jambi's decision, making when she decided to join a startup after graduating when she already had a turn off from big tech companies like Google and Microsoft. How Jambi became one of the first AI engineers at Coda, despite being told no when she first volunteered to join Coda's in-house AI team. What Jambi works on at Open AI and why she thinks Open AI moves so fast. And many more topics. If you're interested in AI engineering or how to make that transition from software

Starting point is 00:02:20 engineer to AI engineer, this conversation has lots of relevant tips and observations coming from Jambi. If you enjoy the show, please subscribe to you. to the podcast on any podcast platform and on YouTube. So, Jambi, welcome to the podcast. Thanks for having me, Gurgay. You were in a good college in Dartmouth, but you then got an interest at Google. Now, not everyone working at Dartmouth can get into a place that it was very competitive.

Starting point is 00:02:43 How did you get that internship and then the Microsoft internship? What was the interview process like? And what do you think helped you get your step in the door, basically? Back then, I didn't know anyone at Google or Microsoft. So I applied through their portal and I remember for university students they asked you to write essays on why you want to work there. So I remember in those essays

Starting point is 00:03:04 talking about the things that I had built outside of classes as well as why I wanted to work there in particular. I was lucky to get picked up from the stack to be honest and then leak coded to prepare for their interviews. So tell me about like preparing for lead code. I mean these days it is somewhat common than known but there's, you know, two side of people.

Starting point is 00:03:23 Some engineers or college students, they enroll, I saying, oh, this is, you know, pointless, it's not the job, et cetera. And then some people just, you know, like, it sounds like you just kind of like went through it, studied it, prepared it. How did that go? So for Google, that was in my sophomore year. And I remember being surprised that I even got an interview in the first place. So I wasn't studying actively before.

Starting point is 00:03:46 I, when I got the interview, I asked a couple friends, what do I study? I think they sent us a pamphlet of things to look at. And in that pamphlet, there was that green book because back then, neat code wasn't a thing. There wasn't blind 75. And so that green book, I'm forgetting what it's called. But I remember buying that book, locking myself in my room for three weeks. Cracking the coding interview, probably. Yes, cracking the coding interview.

Starting point is 00:04:13 And just reading as many questions as I could back then. Yeah, you have it. Yeah, I'm cracking the book. I even have a version of it, which was the white book that that was like 10, 10 years ago. but so the author of this was actually a Google interviewer like I think 15 or so years ago Gail Lachman McDonald and now she she actually sometimes I'm not sure if she sales does it but she used to run training programs at companies at Uber she came and she ran our training program on how to do coding interviews what kind of signals to get how to how to change it so it's actually

Starting point is 00:04:48 really nice because she she teaches the companies on how to do it so then she can update the book and actually have up-to-date of how it works at different companies. Wow. She definitely changed the game in that I'm not sure how much things were written down before that. So I think she definitely paved the wave for neat code and other people to build on top of this. Yeah. If you want to build a great product, you have to ship quickly. But how do you know what works?

Starting point is 00:05:13 More importantly, how do you avoid shipping things that don't work? The answer, Statsig. Statsic is a unified platform for flags, analytics, experiments, and more, combining 5 plus products into a single platform with a unified set of data. Here's how it works. First, Statsic helps you ship a feature via feature flag or config. Then, it measures how it's working, from alerts and errors, to replace the people using that feature to measurement of top line impact.

Starting point is 00:05:43 Then you get your analytics, user account metrics, and dashboards to track your progress over time, all linked to the stuff you ship. Even better, Statsic is incredibly affordable, with the super generous frees here, a startup program with $50,000 of free credits and custom plans to help you consolidate your existing spend on flags, analytics, or AB testing tools. To get started, go to Statsic.com slash pragmatic. That is, sta, tsig.com slash pragmatic. Happy building. This episode is brought to you by Cinch, the customer communications cloud trusted by thousands of engineering teams around the world. If you've ever added messaging, voice or email into a product, you know the pain.

Starting point is 00:06:22 Flaky delivery and platform stack with middlemen. Cynch is different. They run their own network with direct carrier connections in over 60 countries. That means faster delivery, higher reliability, and scale that just works. Developers love Cynch for its single API that covers 12 channels, including SMS, WhatsApp, and RCS. Now is the time to pay attention to RCS, rich communication services. It's like SMS but smarter, your brand name, logo, and verified checkmark, all inside the native messaging app. Built by Google, now rolling out with Apple and major carriers, RCS is becoming the messaging standard.

Starting point is 00:07:00 Cynch is helping teams go live globally. Learn more at cinch.com slash pragmatic. That is s-in-c-com slash pragmatic. And so how did your introships go at both Google and Microsoft? must have been really exciting to like get it. Google was the first one, right? It was a phenomenal experience. And it was exciting for a couple of reasons.

Starting point is 00:07:24 First, it was just a privilege to get access to these code bases of places that I admired. I remember when I was at Google, I was on the search team. And I would use MoMA, their internal tool to find documents. And so I remember so many weekends where I was just trying to find company documentation on how the search algorithm really works or comb through the code beyond the code that I was touching to get a sense of, well, what makes Google tick? So from an intellectual perspective, it was just very exciting back then. Second, you also learn a lot of technical things that you don't get exposure to in college.

Starting point is 00:07:57 Like how to effectively operate in a large code base, the importance of writing unit tests. Now when I look back, it's trivial. But for a college student back then, I remember it was very important learnings that I really value getting back then. And to me, my favorite part was having access to people that were five or ten years ahead in me and their career. I remember over coffee chats asking many of them, you know, what career advice do you have? What are things that you loved in college that I should do more of? And some of the advice that I got really paved decisions that I made. So that was my favorite part of the internships. I would say in hindsight that given the big tech and startups are such different experiences

Starting point is 00:08:41 and you learn so much at each, it would be more educational to do one startup internship and one big tech internship to get a very robust overview of what both experiences are like very early. So like looking back now that you've done both Google and Microsoft, they were somewhat similar-ish. Is it safe to say?

Starting point is 00:08:59 I mean, at the high level, right? Like we know every company and every team was different. Yes, at a high level. What was different is I wanted my junior year to work on operating systems because at that point, I'd just taken a computer architecture class And I loved it.

Starting point is 00:09:12 And so I wanted to go deeper in the stack. So from a technical perspective, they were very different. But from an experience of what do companies look like and how do they work, which is a huge part of an internship, that part was similar. So what did you work on at Microsoft? Was that OS? Yeah, I was working on OS specifically. I was working on the Azure OS team.

Starting point is 00:09:32 It was a product that lets you interact with Azure Blobs locally from your file system. So it hydrates and dehydrates those. blobs. You can think of it like Dropbox for Azure Blobs. Yeah. Nice. That is so cool. I mean, both that you decided that you want to do something a lot less conventional, you know, like not the usual SaaS apps or web apps or whatnot. And that you were able to make it happen. Did you express this preference when you got the internship? Yes. I remember talking about my computer architecture class where we built up a computer from transistors and conveying how mind-blown I was from that experience and how I really wanted to

Starting point is 00:10:11 work on operating systems. And then I was lucky that they put me on that team. That's awesome. But I think there's a learning here of like you don't ask, you don't get. So like again, just just, I just remember when I was running, I set up our first internship at Uber and Amsterdam. So for for that site. And, you know, like once we made an offer to interns, like you go through the interview process. But I also ask people like if they have preference. And most people just do not have preference. So there is this interesting thing that if you do express your preference, again, worst case, you know, you'll get whatever it would have been. But from the other side, a lot of people often don't speak up.

Starting point is 00:10:46 And, you know, the people who are at these companies, they really want to try this win-win, especially for internships. The goal of an internship is to have a great experience and companies would like you to return. It goes both ways, right? They evaluate you, but you also evaluate them. So they will actually do it. It's just a really nice learning.

Starting point is 00:11:02 Like, yes, express what you're hoping for and it might just happen. Yeah, and these companies have so much IP and so much that we take for granted today, but are really hard technical problems that they have solved. So it's just a treat to then go work on something that you admire and get to actually see how that code works. Absolutely. Once you're in there, these companies are so amazing with how big they are, especially as an intern, a lot of doors are open. You can also just ask and they'll be super happy to do.

Starting point is 00:11:32 So then you made a very interesting decision because now you were interned at Google, you're interested in Microsoft. A lot of people would be very, a lot of students or new grads would be super happy with just having one. As I understand, you could have returned to either. And then you made the decision to not do that.

Starting point is 00:11:50 Why? Google and Microsoft, you love the teams. Tell me about how you thought about the next step of what you would like to do after you graduate. So I told you how I was having coffee. chats at Microsoft, my junior internship with a bunch of mentors, mentioned that startups are great experience as well. So I started to evaluate the big tech versus startup option.

Starting point is 00:12:13 And I don't think it's black and white. I think there are really good reasons to go to both. The way I saw it, the upside of going to big tech was first you learn how to build reliable software for scale. It's very different to build something that works versus build something that works when it's swarmed with millions requests from around the world and Redis happens to be down at the same time. Very different skills. So that was one upside. Different upside for big tech in general was that you do get to work on more moonshot projects that aren't making money today. They don't have the same existential crisis that startups do. And so they can work on things that, you know, great AR VR research is happening. Back in the day, I think Google was one

Starting point is 00:12:58 the best places if you wanted to do AI research. They're also practical good reasons to go to big tech. I'd get my green card faster. I'd get paid more on average. And the unfortunate reality, I think, is that the role does hold more weight. People are more excited about hiring an L5 Google engineer versus an L5 from a startup, especially if that startup doesn't become very successful. With that all said, though, I think there are great reasons to go to a startup. And back then, this was hearsay based on what I heard from mentors. But now having worked at a startup for three years, I can confirm it's indeed true. First, you just ship so much code, right?

Starting point is 00:13:36 There are more problems than people, and so you get access to these zero to one greenfield problems that you wouldn't necessarily get, where at Big Tech may be where there are more people than problems. Second is the breadth of skills, and this is not just in the software engineering space. Right, from a software engineering space, maybe one quarter you're working on a growth hacking front end feature

Starting point is 00:13:56 on the next quarter you're writing terraform. But even in terms of the non-technical skills, you get an insight into how the business works. and you're expected to PM your own work. So there's so much breath over there. And you just get more agency in what you work on. You get the opportunity to propose ideas that you think would be impactful for the business and go execute on it. So that breath and learning opportunity to me was a huge upside that got me very excited about startups.

Starting point is 00:14:19 It's just so nice to hear you summarize this because the reality, what a lot of people do is they go to one company or the other, either Big Check or startup. And then they're there for a long time. and then one day they might switch, but there's a lot of like some cost fallacy, you know, like you're used to this. So some people actually, after a few years, they might go back to the same type of company. And so I think there's a few,

Starting point is 00:14:41 there's relatively few people who see this as, you know, with such short and focused time different to see the different upsides, like you have. And as you said, so sounds like the upsides that happened. So you went to Cota, right? Yes, I did go to Cota.

Starting point is 00:14:56 And then how do things? Oh, so you mentioned some of the, I assume that all all happened there. But, you know, what other things? Sounds like things sped up there, actually, from a professional learning and also career experience. Definitely. I went there for growth and breath,

Starting point is 00:15:12 and I definitely got that in terms of the opportunities that I got to work on. So it was a phenomenal experience. And I'm happy to dive into the specific work I did, but overall just a phenomenal experience. But before we do, before the podcast, we talked a little bit about how you thought about selecting it. a startup because like you did go to Kota, but as I understand, this was not just, I'm just

Starting point is 00:15:33 going to, you know, like, oh, this looks like a good startup. You actually thought about how to select a potentially great startup that would have that kind of potential growth potential. What is your mental model? And how did you evaluate and how did you kind of, you know, like going to rank the startup? What was your application process? So back then, I didn't have startup experience and I also went to a school on the East Coast where not many peers around me were going to startup.

Starting point is 00:15:59 So I very much looked for where are places where I love the people in terms of them being smart, people I can learn from, as well as being very passionate about the product because I think you do your best work when you are passionate about what you're building. So it was definitely something where I looked for from those two lenses. Today, after having been in Silicon Valley, though, for four years, I have a more robust rubric on what I look for. So that's definitely evolved since then. because one thing that's become super clear after living here is that your career growth at a startup is very contingent on the startup growing. So then how do you choose which startup is going to grow? And that's a hard question. You know, venture capitalists spend all their time thinking about this.

Starting point is 00:16:43 And today, what is your mental model? Or for someone who has a few years of experience a bit like yourself, what would you advise for them on how to think about different categories of startups, the kind of risk? the upsides and so on. There are startups of all different sizes, and the smaller you go, the more risk there is. I think that's part of the game, and that's what makes it exciting because you also have more upside when there's more risk.

Starting point is 00:17:11 That being said, I feel very strongly that all engineers that take a pay cut to go to a startup should have an informed thesis on why they think that company is going to grow during their tenure. And how to actually assess growth is a hard question with no right answer. But my current rubric is looking for four things.

Starting point is 00:17:32 First, high revenue and steep revenue growth rate. Second, a large market where there's room to expand. Third, loyal obsessed customers. And then fourth competition, why this company will win in that space. And I'm happy to go deeper into any of those. But that's at least how I think about assessing different startups today. And it's all relative because a startup that is, pre-PMF will have less revenue than a startup that is series D 400 people.

Starting point is 00:18:01 And then when you're like thinking about these four different things, so like we'll later get to your actual job search as well, but do you like try to find these things? So for example, you mentioned one thing about how customer, customer session, right? Like how much customers love it? Like let's say you have a, there's a startup that you're kind of interested in.

Starting point is 00:18:22 How do you evaluate that? Do you kind of look it up yourself? Do you put in the work? Do you try to somehow out? source it. What will work for you? Because there's no right answer here, I think it's really important to do the due diligence yourself because you're going to be the one that's responsible for your decision here, good or bad. How I think about things like customer obsession is I look on Reddit on YouTube to try to find real users for more SaaS companies

Starting point is 00:18:47 where you may not have customers writing about the product online. I'd actually find companies that use that product and then go try to talk to them and understand from the ground what do people think about this product, especially if this is a product that I can't use myself because it's not for customers, but for businesses instead. I love it. And again, I don't think more enough people do this kind of due diligence and they should. You know, one, I guess now, but a famous example is FAS, the one-click checkout startup where they recruited actually, there were some ex-Uber folks there who I like knew to some extent, but a lot of people were recruited with a shiny diagram that showed headcount growth.

Starting point is 00:19:26 And people, most, a lot of people did not ask about revenue or, or when they did, they were, they were okay, not hearing about it. And even the people who worked there for a while, they ignored it. And there were some people who actually asked about it and they actually realized that something was off. But just following your framework, for example, some people who are a bit more diligent could have avoided the same thing with customers, for example. There were not many. And like one learning that I had back then and talking with engineers who were there and got burnt, they all told me, I wish I would have done a bit more. due diligence and not taken the CEO's word for it, but also ask for proof, say same thing, revenue, runway, those kind of things. Yeah, I feel like, you know what, startups were paid in equity, a large chunk. And so you're investors. So you have the right to all this information. And to me, if a startup's not giving you that information, that is a red flag in and of itself. Yeah, I feel maybe people should think about that if you join a startup a bit, like,

Starting point is 00:20:21 but like if you put in like a bunch of your money, like a significant amount of your savings. And when I did angel investing, if I didn't get information, I mean, you can still put it in and you can hope for the best, but I think that's called gambling, to be fair.

Starting point is 00:20:34 It is. And so that's okay. But then just be honest with yourself. Like if I'm not getting this information, I'm gambling my most valuable time and very valuable years of my life. And that's okay, right? It could work,

Starting point is 00:20:45 but it's maybe not the smart way. Exactly. And as engineers, we have, when we're recruiting, we're elite coding, we're doing system design. It's hard to carve out that time to do diligence. And so it's something I think we don't talk about enough.

Starting point is 00:20:56 I will say that as a hiring manager, even as a manager, when you join a company and if you previously done your dues diligence, you will have a better start. People will remember you saying, oh, this is this person who actually cares about the business, cares about where it's going, cares about how they can contribute. So on day one, you're already not just seen as like, oh, you know, like new start or XYZ, but like, oh, like this person has drive. Like I think that comes across.

Starting point is 00:21:21 And honestly, if a company is not happy, you just trying to understand the business seeing how you can fit in, it's probably red flag itself. Let's be real. Yeah, that's true. That's fair. So at Kota, you joined as a software engineer and you then transitioned into, you know, I looked at your LinkedIn to AI engineer. How did that happen?

Starting point is 00:21:39 And how did you make that happen? Because it sounds like you actually had a lot to do with it. So if we rewind to end of 2022, that was when ChatGBT came out. And Koda was starting. Oh, yeah. Yeah, big milestone. And, you know, Koda saw the amount of love that this product was getting. And Koda was starting an AI team with two engineers to build an AI assistant to help you build your Koda documents.

Starting point is 00:22:03 At that time, I asked that, hey, I'd love to be a part of it and got a very polite no. So I thought, no problem. I'm just going to start building in this space anyway in my nights and weekends because this technology is very cool. The first thing while I was learning was trying to answer to myself, how does chat GPT even work? And through that went down a rabbit hole of self-studying the foundations of deep learning. So starting off with the very basics of what's a token, what's a weight, what's an embedding, to then understanding that, okay, LLMs are just next token prediction,

Starting point is 00:22:37 going through the history of different architectures of how we went from RNNs to LSTMs, and then building my way up to the transformer and understanding that, okay, its positional encoding and attention that has allowed us to scale up in such a good way. What this did for me was to just give me some intuition of how this technology works, which gave me a bit more confidence. After having built that foundation, I wrote about it in my blog, and so my team was aware that I was working on this as well. I started to build on top of these models.

Starting point is 00:23:07 So I went to a lot of hackathons. My favorite one was a way to learn languages while watching TV. because that's the way that I learned Hindi and I wanted a way to practice my Mandarin in that way. When I was building and doing hackathons, I got a sense of how to actually use these tools. So after five months had passed, when I asked again to join the AI team, I got very much a heck yes, come join us. We see that you truly care about this because you've been working on it in your free time. And that's when the real learning started because hacking on top of models in your free time is very different from trying to build it for a production. especially because as engineers, our role is to build reliable systems, but you're dealing with stochastic models.

Starting point is 00:23:51 So they're very much at odds at with each other. And when you say hackathons, is this, was these like weekend hackathons, you know, the ones that anyone can attend, you register? And like especially they were popping up because of the AI, you know, like hype basically starting. Yes, weekend hackathons. I also did. So the project I was telling you about that was a language learning tool that was with an online hackathon. for six weeks with this company called BuildSpace. Anyone can go join.

Starting point is 00:24:19 And the way you win in this hackathon is not by what you build, but how many users you get or how much revenue you're generating. So it's such a fun way as an engineer to not just build something, but actually go and try to get people to use it. So it was a very fun learning experience. And because it was all online, they really encouraged us to build in public. And that in and of itself was a great learning.

Starting point is 00:24:41 I love it because I think a lot of times when a new technology, comes out and a lot of engineers, especially you had a day job and the people who have a day job, the biggest thing is like, hey, I can build something on the side, but what should I even build? I mean, you know, like it feels it's kind of pointless, like you can do the tutorials, but especially in this case, there's not many

Starting point is 00:24:59 and tutorials are kind of not there. So I love how you found a way to have a goal, to enjoy it, to do, you know, scratch your own itch as well and combine it. So like maybe these online hackathons or like hackathons happening around you, it could

Starting point is 00:25:15 be a great way to do it. And it sounds like it actually helped your professional, like, you help, help even your company and your job because now knowing how to use these tools was very much in demand. It still is. But there were not many people who were like as enthusiastic and as self-law. One thing that I learned from that experience was don't wait for people to give you the opportunity to do something, just start working on it. I love this. This is such a good mindset. So when you join this team, so technically did you become an AI NGO? What do you think even an AI engineer is, I feel is this kind of overloaded term. So I just love to hear like how you think about it. AI product engineer is building products on top of models. And the work

Starting point is 00:26:00 entails first, a lot of experimentation of this new tool came out, experimenting with what you can build to solve real customer problems, prototyping it. And then from there going to actually building it from production. So at its core, it's very similar to software engineering. There are some domain specific things like learning how to fine tune, learning how to write good prompts, learning how to host open source models. But in it of itself, the foundation is very much software engineering. Yeah. And I guess, you know, I guess evaluation is still is also a big one. Yes, that's a great one. Writing get ebouse. And then like one thing that was very surprising for me to learn when I talk with a friend who works at a startup is how their test suite costs money to run every time. The Eval

Starting point is 00:26:47 suite, they're like, I don't know, like how many, like $50 or something like that. And it's like, oh, you know, when I run my unit test, like it costs time and effort. But it's, it's free. It's just time. And now you actually, especially if you're using an API, you have this cost, which is I think refreshing and just a good way to think about it. And it just forces you to adapt. Yeah, for sure. It's very interesting because there's no good way to measure the accuracy of a non-deterministic model without using LLMs. And so at Kota, we used to use brain trust. And it was so interesting and how the model is being used to check whether or not it's working correctly.

Starting point is 00:27:29 As you're just going deeper and deeper into the AI field, like, what were resources that helped you? Was it just pure self-learning? Was it going to the source of the where the page? papers are. Like, this is a really ongoing question because, you know, the industry is not slowing down and there's not many kind of books or like, you know, static resources out there. Yeah, very fair. Because things are changing quickly and there aren't static resources at that time, and I still true today, I found it most helpful to learn by just doing. So even when I was on this team, I'd go to a lot of hackathons, internal to Coda and external. I remember there was an internal hackathon at Coda, where. where it happened to line up with the day Open AI released function calling for the first time. And so our team, we played around with the power of function calling, which is a very important tool, by turning natural language prompts into appropriately identifying what third party code integration you should use.

Starting point is 00:28:32 So for example, a user types in, how many unread emails do I have? and it should appropriately pull out the Gmail pack or Gmail third party integration that Koda had. At that hackathon playing around with embeddings from Pinecone to see can I more accurately pull out the right third party integration? So that was one way through internal hackathons, but there were also external hackathons. I remember in SF there, when Lama 3 first came out,

Starting point is 00:28:58 they were hosting a fine-tuning hackathon. So I went. The beginning, they tell you what is fine-tuning, how to use it, which is great. Then there are a bunch of startups there that are building fine-tuning platforms. So they give you free credits to go fine-tune. And so then I remember building on top of replicate and fine-tuning a way to turn Lama into Koda formulas, which is our equivalent of Excel formulas.

Starting point is 00:29:24 So learning by doing to me was the most effective way when things are changing so quickly. And even though hackathons are the most effective, you know, reading blogs, some paper, Twitter to see what other people are doing, did help. help. There are a lot of open source companies. I remember back in the day, Langchain had lovely documentation on how to do rag when it was first getting popularized. And so reading what other people are doing, even though they're informally written, it's not a textbook. It's not a course has been very informative as well. Nice. Well, yeah, I guess this is so new. It's using to figure out what works for you and just try a bunch of stuff and see what sticks. And also it changes, right? So like,

Starting point is 00:30:01 whatever works now, it might not be as efficient later. So totally. Yeah. And there are books coming up. remember you interviewed Chip and she has a lovely book on how to build as an AI engineer. Yeah, yeah. She actually captured a lot of the things that are not really changing anymore. So that's also changing. And I think, you know, we'll now see courses come out. And Andre Carpathie is doing some really, really in-depth like courses if you have the time, which honestly, it doesn't sound like a bad time investment to do so.

Starting point is 00:30:29 Yeah. Yeah, exactly. With zero to hero. Yeah. So at Koda, what was your favorite project that you built using AI tools? or your favorite AI product? A project that's very close to my heart from Koda is workspace Q&A. So maybe to set some context, at Koda, a very common customer complaint was that I have so

Starting point is 00:30:47 many documents with my internal know-how of company-meant-needed documentation, but it's hard to find that document when I need it. And about in November, 2023, Rag was getting popularized, retrieval augmented generation, and it struck our team that we actually had all the tools in place to build a a chatbot that would solve this problem. First, we had a team that had just redone our search index and they put a lot of hard work into redoing that search index. Second, we had the infrastructure in place to call LLM tools reliably. And third, we had a chat bot that allowed you to, in your Coda doc, chat with an LLM. With those three things, I was able to just glue them together in a

Starting point is 00:31:31 couple days and build a version one of a chatbot that lets users ask questions about the content of their workspace. Oh, nice. So I put that, you know, on Slack with a loom. And to my surprise, I see or Shashir started taking interest in this and responding to that thread. He saw a grander vision where Coda could create an enterprise search tool. So it's not just searching documents, but all third-party integrations, which Coda had a lot of. So ideally, you know, a sales team should be able to come in and say, what's my projected ARR for an account? And it pulls in from your Salesforce integration and answers that question for you.

Starting point is 00:32:10 So that was exciting. And he basically tasked a couple of us to experimentally prove out that CODA could build this in four weeks. Oh, nice. And a good challenge. Yeah, it was a good daunting challenge. It was me, my manager, the CTO, a designer, or a PM. And it was short,

Starting point is 00:32:30 deadlines and high stakes because this was going to be demo to important people. So it was very much all hands on deck. On one day's notice, we flew to New York to hack together. And it was nights, weekends, whatever it took to make this work. It was a very exciting time and I think a lot of blood sweat and tears behind it. But the TLDR is that it did go very well. And it became the birth of a second product for Koda called Koda Brain. From January to June of 2024, we had a much larger initiative where 20 people were now working on it.

Starting point is 00:33:10 And it was taking that version two that small team we had built and making it a more robust thing, which is a very hard challenge in it of itself. And the cherry on top was that Coda Brain was presented at Snowflake Dev Day at the keynote. So it was just a very exciting time to be a part of it from day one and the world getting to see it at a large scale. Yeah. So I'm just like taking notes on like how amazing it is that, you know, you joined Coda as a new grad with like no experience and AI engineering. And I'll just frankly, you know, you had less experience than like a lot of the experience engineers and software engineering. I mean, just the years of experience. But from from the first day, like you just kept track of the industry. You saw this exciting thing is coming out. Chad GPC you tried it out.

Starting point is 00:33:57 You were convinced this, this is this is going to be interesting and fun. You asked your manager when Koda started a team to join. They said no, and you just went and learned. And in a matter of a few months, you probably leapfrogged a lot of the people who were just kind of waiting or, you know, not necessarily like being as as active as you are. You got onto this team as an early engineer. And, you know, a year later now when 20 people were working on this with Koda, you were still like one of the earlier ones. So it just like shows me how like what you were saying, not waiting for a permanent. mission really pays off and you can just do things, you can learn things. And especially for

Starting point is 00:34:36 an innovative technology like AI and whatever we see next, it's actually valuable. Like a company will value this kind of thing because it helps them and they want, they desperately need people like you were in this case or other folks who are doing similar things. What is really cool is that it's so new. So it definitely levels the playing field between all sorts of seniorities because nobody knows what the right way is. And so we're all just figuring it out together. And that's what makes it definitely more exciting. Yeah, and I feel there's like two things here.

Starting point is 00:35:06 Like if you're someone who already has some experience, may that be one year or 10 years or 20 years, that experience will eventually be applicable. Once you understand how this works, you know, you can take that past experience and see how it applies. And if you don't have experience, it's actually not a bad thing because you're coming in with a fresh mind and you will probably be, you will not have some of those biases of,

Starting point is 00:35:27 you know, for example, a lot of software engineers who have like 10 plus years of experience, they will know a Quibald production system that unit testing and automate testing is super efficient and a very good way to do stuff. Now, with AI systems, it's not necessarily the case when they're nondeterministic and things like for large-scale system, things like monitoring or checking evolves might be a better way.

Starting point is 00:35:48 I'm not sure which one it is, but not having that bias could actually speed you up. So either way, it doesn't seem to be any downside in just figuring it out. and mastering this tool, because it is a tool at the end of the day. Yeah, just, it's a new tool in our, it's honestly a magical superpower, because now it just unlock so many things that you can do on top of it. Yeah, but it feels a bit like, you know, the Harry Potter one.

Starting point is 00:36:11 Like, when you watch the movies, like, you know, at first it sounds magical when you read the book, like you can do all these spells. But if you're a hardcore Harry Potter fan, you will know that there's only certain spells that you can do and, you know, there's a certain thing that you need to say. And so there's a, there's a whole mechanic around it. And like, for every fantasy, you know, you will know, book as well when there's a magical world. Like, there are the rules and there's people who can master those rules. And I feel it's the bit insane. At first it's magic, but actually it has

Starting point is 00:36:36 the rules. And once you learn it, you can, you can be this, you know, sorcerer who can. Yeah, exactly. This episode is brought to you by cortex.io. Still tracking your services and production readiness in a spreadsheet. Rearmarked your service is named after TV show characters. You aren't alone. Being woken up at 3am for an incident and trying to figure out who owns what service, that's no fun. Cortex is the internal developer portal that solves service ownership and accelerates the path to engineering excellence. Within minutes, determine who owns each service with Cortex's AI service ownership model, even across thousands of repositories. Layer ownership means faster migrations,

Starting point is 00:37:15 quicker resolutions to critical issues like Lockford-J and fewer adhere-pings during incidents. Cortex is trusted by leading engineering organizations like a firm, TripAdvisor, Grammarly, and SoFi. solve service ownership and unlock your team's full potential with Cortex. Visit Cortex.io slash pragmatic to learn more. That is C-O-T-X.I-O-S-Pagmatic. So then you had a really great run at Koda, and then you did something like you decided to look around on the market, and you blogged about this. You interviewed at 46 companies.

Starting point is 00:37:52 Did I get that right? Yes, but there's context behind that. I'd love to understand, like, how you went about interviewing, especially specifically for an AI position. What did you learn about what the market is like, what interviewing is like, what the whole scene is? And if you can give a little context on like where you did this in terms of location-wise, types of companies just to help, you know, us all understand this. Sure. Maybe just by giving a little bit of context, it was over a six-month period. And the first half, I wasn't closing them.

Starting point is 00:38:23 I was getting nose as I was getting wrapped up on my leak code. system design prep. After that, the interview process did take longer than I expected, though, because the AI space is especially noisy right now. And when I was trying to do my due diligence, like we were talking about earlier, there were often open questions that made me feel uneasy about the growth potential. And the advice I got from a mentor was that if it's not heck yes. And if you have savings, don't join. It's not fair to the company or you. So that was how I thought about this. In terms of the space, it was something. It was something. It was a lot of the space. It was clear that there are the product companies, infrastructure companies, and the model companies.

Starting point is 00:39:00 I found it helpful to put companies in each category and figure out which segment you're most excited about to help narrow down the options, given that there's so many AI companies right now. Could you give just an example of each, especially with the infrared model? I think it might be a bit. I'm interested in how you're thinking about that. Yeah. Product companies are the companies building on top of the model. Here I think of cursor, codium, Hebia, infrastructure companies are the companies building the tools to help AI product companies effectively use LLMs. So whole suite of these.

Starting point is 00:39:34 They're the inference providers like modal fireworks together, vector database companies like pine cone, chroma DB8, eval and observability tools like brain trust, arise, Galileo, and a whole other suite of products. And then there's the model companies, which are the base of the ecosystem building the intelligence. You have the big tech companies like Google meta building models. And then you also have startups like, or not startups. You have other big smaller companies. You should be startups. Like opening eye, anthropic building models as well. I think it's a really good way to think about it. And again, I don't think many of us have verbalized it like this or this also goes back to not many people

Starting point is 00:40:15 have necessarily gone through. I will say this is not something that I came up with myself. Yash Kumar, a mentor, he pointed out that you should look at the space like this. And that's how I think about it now. Wonderful. And what did you learn about, like, each of these companies in terms of the interview process, what the vibe was like, generally, and also, like, how, how you personal felt about it. Because, like, as I understand where you were, Dakota, we can put them in the product

Starting point is 00:40:40 category. Sorry, the product company category. So for me, in trying to be more focused in my search, I decided to focus on model and infrastructure companies because I wanted to keep getting breath in one. I was doing and I felt like the product companies were too similar to my experience at Coda, which was phenomenal, but I wanted to keep growing. And that definitely, the tradeoff was that is a bit more of an uphill battle because the work that I had done was not as relevant to model or infrastructure companies. In terms of the vibe, I think all of them are shipping really fast,

Starting point is 00:41:11 have really lean teams and are out to win. So it's a very exciting time to be looking at them. Questions I would ask myself when I was trying to figure out, is this company viable in the long run. On the infrastructure side was, are there margins high enough given that so many of these inference providers are also paying for expensive GPUs? So what is the margins here, especially when a good software business should have about 70% gross margins? And how easy is it to build infrastructure in-house? You know, we know this, but engineers are a hard group of people to sell to because if it's not complex enough, if it's too expensive or doesn't work exactly how they want, engineers will just build it in-house. Google is a phenomenal example that's built so much in-house.

Starting point is 00:41:56 So that's how I was thinking about the infrastructure companies. In terms of the model companies, I was just trying to get a sense of if they're training frontier models, can they afford to keep training them, given how expensive it is? Are they staying ahead of the open source competition? Because if they're open weights that exist for a model, no one's going to want to pay a premium to get the model from a close source provider. It's the sound reality. It is. And I think that it's interesting because today product companies are still willing to pay a premium for the best model, even though an open weight exists, as long as the closed source provider is ahead. Yes. And anyone who's not nodding along when they'll find themselves evaluating an offer or company and trying to understand the margins, that's a hard one to do it, especially as an engineer. Yeah, exactly. Where did you get like data or did the companies answer some of your questions on the unit economy?

Starting point is 00:42:51 These are things that companies like to have underwrapped, even as someone who's covering sometimes these companies or just interested in the space, even publications, like financial publications, you know, will just kind of wave their hands because it is hard. Yeah, like this is the big question. And these companies, they want to hide these things from the casual observer for sure. Exactly. I think it's totally fair for a company not to share this information until you get an offer because it is sensitive information. I do think once you have an offer, it would be irresponsible for them not to tell you when you are as an investor as well. And you sign an NDAS. You keep it to yourself. So I do think they should tell you.

Starting point is 00:43:28 For questions or for companies in the inference space, I would just ask, you know, how much money do you spend on the GPUs? And then how much revenue do you have to make rough back of the envelope math of what those margins are like to just get some sense of the business. And then I also found it helpful to read some news providers like the information that does very good. diligence on the economics behind different startups in the AI space. And if I could, I would try to also ask investors who have invested in these companies or passed on investing in these companies because they see the deck come to them. So they have a lot more insight onto what the business really looks like. You're talking like an investor or like how a senior executive would do it, which I love.

Starting point is 00:44:18 I think more people should be doing this, by the way, and not enough people are doing it. So it's just very refreshing to hear. And by the way, like the investor starts interesting, because in my experience, investors, when you are applying to a company their investor, and they actually want to help close great people. Yes, exactly.

Starting point is 00:44:35 They will happily connect. And then you also have a connection where a few years down the road, that investor might reach out to saying, oh, I remember, you're a business-minded engineer. You know, like in the future, it's hard to tell. I think we were talking about this before what will be in the future, but there will be only more demand for software engineers who not only know how to code,

Starting point is 00:44:54 but are curious about the business, can communicate with users, etc. So you'll now have a network, a stronger network. So there's only upside in doing your due diligence. It can actually help your career. That's true. And I 100% agree with investors being very generous with their time in wanting to chat with you and explain to you how the business works. So that's been something that's been really fun to do for sure.

Starting point is 00:45:15 And then just going back to like, this is all great when you get an offer, but how did you get to getting an offer? Like what did you need to brush up on in terms of interviews? Was it the pretty typical, you know, tech interviews, even though these were for AI enduring roles of the lead code system design or for there's some AI specific things? You know, what helped you go from initially you stumbled and you didn't get too much to like, okay, you actually like we're getting offers now? In terms of the interview process, I definitely thought it was all over the place as the market. it is trying to move away from leak code but still asks leak code. So then you end up having to study leak code as well, unless you know exactly where you're applying to.

Starting point is 00:45:55 So there were coding interviews, system design, and then projects. Coding was a mix of data structures and algorithms where the best way to do it is leak code. Luckily, neat code with an N now exists and he has been all no videos. So that was great. I believe in doing space repetition. So doing those questions a lot of times. then there were front-end questions because I'm full-stack engineer as well. And I found that there was this resource, the great front-end,

Starting point is 00:46:21 that had lovely interview questions for the obscure JavaScript questions they sometimes ask. On the back end, that one I just more relied on things that I had done at work for those interviews. That's the coding part. The system design part, I thought Alex Shue's system design, his two books, phenomenal, just reading those, really understanding them, doing them again and again, until you understand why things work a certain way. Honestly, I love system design interviews. They're really fun because you learn things outside of the domain that you're in as well.

Starting point is 00:46:50 And then they're the third type of interviews, which is project interviews where go build something in a day. And those are probably my favorite out of all of them because you get to show how passionate you are about that specific product and you can actually show what you can do. I do hope that as an industry, we move away from leak code and instead move to just project interviews, reading code, which has become way more important today as well as debugging code. But I think we're kind of in the interim where as an industry, we haven't fully formed an opinion here.

Starting point is 00:47:18 And then most of these interviews was at the end of last year, so in the end of 2024 or so. Were they remote or were some more in person already? Swiss in between June of last year and a large chunk were remote, but there were definitely interviews in person as well, which I understand. joy because I was very much optimizing for companies that are in person. Yeah, we'll see. But I think we're sensing a trend or I'm sensing a trend that in person injuries might be starting to go back, at least your final rounds, which, by the way, it might not be

Starting point is 00:47:51 a bad thing. I mean, it's interesting because before COVID, like when, you know, I spent like most of my career there, it was just in person. And there are so many upsides, right? You do meet the people. You do see the location. Oftentimes you meet your future teammates. And for example, for me, I once in London, I had.

Starting point is 00:48:09 two offers between two two banks. And in one case, I met my future team, the whole team. And when I didn't meet my future team, it was just like they said, like, you will be assigned a team. And I actually chose it was a lower salary, but I chose a lower salary because I really liked the people. And, you know, like we just kicked it off. It felt like a good connection. And back then I went through a recruiter. So the recruiter negotiated the same salary for me, which was kind of a win, I guess.

Starting point is 00:48:34 But like, there are like, I know there's, you know, like, it's always. always we will hear people like mourning the end of or fewer remote interviews, but there are all these upsides, which when you're committing to a place for so many, for hopefully many years, you want to have all that information. 100%. Definitely. I think it's energizing on both ends for sure. It's a great point.

Starting point is 00:48:56 And so in the end, you joined Open AI, right? Yes, I did. Congratulations. Thank you. And then can you share on what kind of general work you do at Open AI? Sure. So I work on safety as an engineer at OpenEI. And OpenEI's goal and mission is to build AGI that benefits all of humanity.

Starting point is 00:49:14 On safety, we focus on the suffix of that statement, so benefiting all of humanity. Some things I work on are a small, low latency classifiers that detect when the model or users are doing things that are harmful so that you can block life. So that means the training, the data fly wheel, hosting these models to scale. Second thing that I get to work on is measuring when the models are being harmful in the wild. And there are a lot of dual use cases over here, but really trying to get a sense as these models become more capable and people are figuring out different ways to jailbreak them and exploit them, what are those unknown harms that we don't know of with more powerful models and then

Starting point is 00:50:00 distilling it into small classifiers. There's also on my team a lot of safety mitigation services that we own. And so part of our work is to integrate it with all the different product launches. And as you know, there are a lot of different product launches. That definitely keeps our team busy. And that's just the tip of the surface. There are a lot more things that we work on in safety. I mean, this is like, it sounds very interesting because when I worked on payments back at Uber,

Starting point is 00:50:23 we had a team called fraud. And, oh, boy, they had so many stories. Like, I just talking with them, I, like, you would think, you know, like payments is pretty simple, like, oh, you need to pay. But then the edge cases are always interesting with every. every area. And the same thing, I guess, with L. I mean, elements are not as simple, but once you realize how they work, next token prediction, it sounds pretty simple. But then the edge cases and all the things that could go wrong,

Starting point is 00:50:46 et cetera, and it sounds like you're kind of in the middle of that, like having a very, like, good vantage point in, actually, in the details. You've now worked at Coda. You've, you've interned at Google and Microsoft and you talk with mentors about like what other places are. What are things that you feel that are just very kind of distinctly different about open AI compared to other companies? I think what makes OpenAAA unique is the mix of speed and building for scale. You know, at startups, you get that speed of iteration and it's so fun.

Starting point is 00:51:15 And then at bigger places, you get to build for scale. But Open Aia is in a very unique spot where you have both at the moment. Things move really fast and you have huge amounts of users. The service that I work on, you know, get 60K requests per second. And you just think normally you get one or the other. And it's really fun to get both. Second thing that I think is quite unique for a company of this size is the open culture. People are very open to answering questions on why and how things work a certain way.

Starting point is 00:51:51 So it's a great place to learn, which is something I didn't realize from the outside. And then third people are just very passionate about the mission, work really hard. And I don't think this is unique to open the eye in it of itself. All companies, I think, where great work is happening. people are like this, but it's just never a boring day in the office because people care so much and are constantly shipping. Yeah, and then talk about shipping. Like you've, I'm assuming you ship some things to production already, but how can we imagine a thing, a project, an idea making into production, right? Like there's a very bureaucratic companies, you know, I don't want to like

Starting point is 00:52:27 say old Microsoft, maybe not today, but where there's like, you know, like very strict planning process, then Jero tickets are created by the PM, the engineers have to pick it up, then someone else might actually deploy it. So this is the super old school and slow and the reason why some engineers don't like it. What is it like you mentioned, it's fast, but what was your experience

Starting point is 00:52:49 in getting things from ideas of production? And is it multiple teams? Can one person actually do it? Is it even allowed? I don't know. I think it's very much allowed and very much encouraged. There's been publications, of how deep research came to be,

Starting point is 00:53:06 where it was an engineer hacking on something, presenting it to larger C-suite and now becoming a full, very impactful product. So it's definitely encouraged, which I love. I, too, have had a similar experience, and it's very encouraged to come with ideas and actually drive them forward. Just strictly from your perspective,

Starting point is 00:53:24 what do you think, like, one thing that stands out that Open AI can actually still ship so fast? Because it feels it defies a little bit the laws of growing organizations. which eventually slows down at one point, I'm sure it will, but there's no signs of this happening so far. My observation is that the systems are built to enable you to ship fast and they give engineers a lot of trust, even though it comes with the downside of sometimes that can lead to outages. To put this very concretely, when I joined, you can make static changes without an approval.

Starting point is 00:53:59 So you have trust to go in and flip a flag to turn something on. That's no longer the case. You need one reviewer. The service that I get to work on has 60,000 requests per second, but you get to deploy with one review immediately. So my observation is that there is truly trust put in engineers to work quickly and not have a lot of red tape around shipping fast. Yeah, and I think this just goes with kind of onset expectation. That expectation will be very high of the people who come in here because you cannot hire an engineer who is used to, you know, being. only doing a small part, not used to thinking about the product and the business impact and all those things. So I have a sense that what you're doing, it might be kind of a given for you, but in the industry,

Starting point is 00:54:42 it might be more common to expect that engineers are just wearing, you know, we used to call it wearing more hats, but it's just like, it's just how it is. Like you do want to have a, you know, like you're kind of merging a little bit of PM, a data scientist, an engineer all in one. And these are the type of people who can actually make something like Open AI or similar companies, like works so well with this many people. Yeah, and I just think with intelligence today, the roles between data science, engineering, back-end, front-end, PM, blurs so much that each individual,

Starting point is 00:55:18 whether you're in opening or not, is expected to do more of that because you can get help from a very capable model. And I think that makes it very exciting for us, because it means that we can truly be full-stack engineers and go from an idea to launch very quickly. Absolutely. So what are some things that you've learned about AI engineering, the kind of the realities of it? Because it's a very new field. And like, what are some surprising things that you didn't quite expect? One thing that I've learned that I didn't realize coming in was how much of AI engineering is about building solutions to known limitations of the model. And then as the model gets better, you scrap that work and build new guard rules. Let me give you an example from Koda. So pre-function calling days, we wanted a way to get our model to take action based on what the user said.

Starting point is 00:56:10 Function calling didn't exist, so we prompted the model to return JSON, parse that, and actually deterministically call an action based on that JSON blog. Then opening I released function calling. Okay, scrap that and instead integrate with function calling. But, you know, back in those days, function calling was not very reliable. And now today we moved from function calling to the MCP paradigm. So things are changing very quickly and the models are getting better, but they're still not perfect. The moment you get more capability, there are more engineering guardrails you need to build to make sure they work reliably at scale. Yeah, and I guess you need to become comfortable with throwing away your work when the model is there.

Starting point is 00:56:52 You just need to not be as attached to it because I think there's a little bit of this, especially when you're used to like things not changing as much in software engineering. So just, you know, like it's not a way. it's a learning. Yeah. And it's just been easier now to or cheaper to produce code. And so you see this expansion and collapse phase happen a lot in where you build a lot of features, see what works and then collapse to what works and restart. There's a lot of scrapping your work as the creation of code becomes cheaper.

Starting point is 00:57:21 It's easier not to be attached when an LLM also helped generate that code. Yeah. I think this will be a big change, a good change. Once we get used to it. Yes, exactly. Now, when it comes to AI and junior engineers, like, you're such a interesting example in the sense that you started your career a little bit before AI took off, but you also transitioned with like not decades of experience just yet. What is your take on how Gen AI will impact new grads, people who are still in college? Because, you know, there's two takes and they're both very extreme.

Starting point is 00:57:57 One is the engineers with 10 plus years experience often just feel like, oh, I feel so sorry for. for these people, like they're not going to get jobs. Even if they get jobs, they're not going to depend on AI. They're not going to read the books. They won't know what it was back in our day. Right? So there's this thing. And also, like, some people are generally worried that, well, you know, you can now

Starting point is 00:58:19 outsource so many things. They're thinking, okay, maybe they can pick up things really quickly, but maybe they're never going to get to that depth. Now, I think they both are extreme. I'd love to hear, like, how you see it because you're kind of seeing this firsthand. Definitely. And you're right. From my experience level, I get insight into what both of those engineering lives are like. And currently, I'm not convinced that AI is going to be disproportionately worse for junior engineers. In fact, I think that it allows everyone to move higher into the stack and be more creative in what you're building. You empower younger engineers to just do more, propose ideas and actually. ship that. I do subscribe to the take that there will be people that use AI to learn and then people that use AI to avoid learning. I would say that there's actually room for both things to exist and

Starting point is 00:59:14 you should be doing both. I personally think that when you're working on a Greenfield project, trying to prove a vision that something should exist, why not skip the learning vibe code it to actually get a real product that you can validate and then go run to build this for real. as a new product line. But I don't think you should skip the learning when you're trying to now build a robust system that you are the owner of because when shit hits the fan and you're in a SEV, AI doesn't help that much because it doesn't work very well in between at a high systems level and then, you know, reading logs. So when you own the code, I think you should use AI to learn to understand all the edge cases of why things work a certain way. So I think there's room

Starting point is 00:59:57 for both. It's going to be an important skill for us to learn when we should outsource the doing versus when we should use AI to make ourselves better and stronger engineers. Yeah, and I guess, like, there's probably not too much harm. And if you don't understand it, spend some time to understand it. And AI will help you typically do this faster. So, like, I, I'm not sure if this is to do with personality or curiosity, but we've seen this before. by the way, like when, at any time, but let's say 10 years before when I was like maybe mid-level engineer, like I saw new grads joined the workforce and we were now using, you know, higher-level language like JavaScript or Python or TypeScript.

Starting point is 01:00:40 Or take the example of the recent, you know, a few years ago, like New Grad engineers, they start with React. And when you start with React, JavaScript and TypeScript, a lot of people who haven't studied computer science and didn't do assembly or C-Lessy or these kind of things, You can just learn React and you can just stay there and you can figure out how to use it. But the better developers have always asked, why does it work like this? What happens? What is a virtual DOM?

Starting point is 01:01:08 How can I do? How can they manipulate it? And you look at the source code. And I feel there's always been the people who do this and they're just better engineers eventually. They can debug faster. They ask why. And, you know, they're so. So I think in this new world, we will just have this.

Starting point is 01:01:23 And I don't think this trait will die out. And, you know, like, to me, you're a great proof of, you know, you go deep. You understand how the things work. And then you think you decide like, okay, I'm going to use it to my advantage right now. I just want to go fast because I know what I'm doing already. Yes. I do think that's spot on and that we've had this in the past. It will become even easier to outsource that intelligence and in some sense be lazy.

Starting point is 01:01:48 So I think we'll have to just be more intentional as engineers to make sure that we are actually going deep in cases where, it really matters. And so far from what you're saying, how, because you've seen before Gen AI tools, you're now working at a, you've been a product engineer with AI. You're now working at a model company. How do you think these tools will change the software engine that you have been doing before? And how is it already changing your day-to-day work? In terms of what doesn't change, we still have code as the way innovation manifests. You go from idea to code to iterate. That's the same. As engineers, you still need to know how high-level systems work and design them very well. You have to debug code really well, and you have to be able to be really good at reading code.

Starting point is 01:02:35 So to me, that all stays the same. What's changed, I think, is the division of responsibilities between PM designer, software engineer. I was talking to a friend at Decagon. I think they were telling me there are 100 people and they still don't have a designer because product is just expected to do the design as well. as a software engineer, this has always been true at startups, but now more than ever you're expected to do product work as well. We talked about this earlier. What also changed is that software engineers become more full stack.

Starting point is 01:03:03 You don't outsource work to another adjacent role like data engineer. You're expected to build those data pipelines yourself. I also think what's changed is that we need to be better at articulating our software engineering architectures and thoughts because you are expected to prompt models to do this. And the engineers that will be most efficient are the ones that can see the big picture, write a great prompt that also catches the edge cases, and then have the model implemented.

Starting point is 01:03:35 It's like the best engineering managers that are able to zoom in and zoom out really well and being able to zoom out, prompt what you need to do. But then zoom in when actually reading that code and catch potential bugs instead of just relying on the LLM to be 100%. right in all cases because there'll be unique edge cases to the system that you're building that the LLM is not aware of and you need to be able to catch that when you're reading the code.

Starting point is 01:03:58 Yeah, I feel like if you have a mental model, and I see this so much when I'm using these tools, you know, when I'm either like vibe coding or prompting or when I know what I want to do, when it's in my head, I either like because I know my code base or I know what I want to do or I just sat down and I thought through it and I drew it out. I'm so fast. I'm great. Like, and I can switch between, you know, I might do an agendic mode to like generate it. Maybe I like it.

Starting point is 01:04:23 Maybe I don't. Then I just do it by hand. It doesn't matter. Like, I get there. Like, I know where I'm going. But when I don't, I did this where like, oh, I tried to vibe code a game. And I failed not because I don't, I just didn't know what I wanted to do. Like.

Starting point is 01:04:36 Yeah. And, and, you know, when your prompt doesn't tell like, oh, do this, then I don't give it guidance. Yeah. Like it. Yeah. No, definitely. It wasn't the fault of, of the tool. It was just, you know, what I.

Starting point is 01:04:48 I don't know what I expect. Like, how would this thing know, which it's non-deterministic, but you need to give it some direction. Exactly, for sure. And I also on that point, think that it's great when you're zero-shotting and doing Greenfield work. But today, and I think this will change, it's not the best at working at in large code bases. And as engineers, you're always working in large code bases when you're building things for prod. And so that part of our job hasn't changed of being able to find. the right place the code should go, use the right modules that exist and piece it together

Starting point is 01:05:25 in a larger code base when you're adding a feature. Yeah, and also just like simple stuff, which we take for granted, but like setting up the tools to like run the tests to know how to deploy, to know how to control the feature flags, to how safely put out something so it doesn't go to prod if you want an AB test, like, which, you know, when you're onboard, this is a pretty kind of given, but if you work at a place that has like microservices or whatever, like it's all. And I feel like there's so many other things. But I love how you summarize what will not change,

Starting point is 01:05:56 because I think that is really important. And I love how you brought up software architecture. I've been thinking about like this recently. In fact, I've started to read some like really old software architecture books. Because there are some ideas that I'm not sure will change. I want to say this theory, but it might not change as much. What books are software architecture books are you reading at the moment? I've been going through the mythical mammoth.

Starting point is 01:06:17 I've almost finished this. This is a real one. And then I have, so this is from the 90s. It's called software architecture, and it's by Mary Shaw and David Garland. And Grady Booch, who's a legend in software engineering, and I interviewed him. He said that he thinks this is the single best software literature book.

Starting point is 01:06:38 Now, it's very thin, and it's, I think, from 1995 or so. So I've just heard to read it. but I'm interested in what 1996, which was just 30 years ago, I'm just interested in what are the things that might have not changed. Clearly some things will be dated, right? Like they're talking about Corba,

Starting point is 01:06:56 which is like this old Java framework that we don't use anymore, but some of the other things, there's a lot of reflection with civil engineering, and this book was written when there was no real software architecture, so they tried to define it, which I'm kind of thinking

Starting point is 01:07:11 there might be some interesting idea. So I'm interested, like, on what has. has not changed for the most part. Yes. No, that's a very nice approach of actually looking at the history to see what's not changed from then to now to then extend that line to what won't change in the future.

Starting point is 01:07:26 I'd be very curious to see what you learn from that. Well, and also reinventing, right? Because I feel we will have to reinvent some parts of the stack. And I think it's important to understand. Also, I feel the past like 10 years or so, we've not talked too much about software architecture. So maybe there is a little bit to learn from other people's ideas. So to wrap up, how about we just wrap up with some rapid questions?

Starting point is 01:07:45 I just ask a question. Okay, let's go for it. And I ask you, so first of all, what is your AI stack? For coding, cursor, for hackathons, deep research to get a sense of what libraries already exist. ChatGBT is my default search and also my tutor and some internal tools to quickly do rag over company documentation when I'm trying to find something. So what is a book that you would recommend and why? Book I'd recommend is the almanac of Nevo Ravikant. I've read it a couple times.

Starting point is 01:08:12 Big fan of the way he talks about. building your life, both from a very pragmatic way about how you should do it from your career, but also in terms of just how to be happy. And what is a piece of advice that made a big difference in your professional career? Don't wait for someone to give you the opportunity to go work on something. Go work on it. Love it. So, Janvi, this was so nice to have you on the show.

Starting point is 01:08:34 Thank you so much for having me in the first place. Thanks very much to Jambi for this conversation. To me, talking with her was a great reminder of how in a new field like Gen. I years of experience might be less relevant than teaching yourself how to use the news technologies like Jambi has done so. It's also a good reminder of how it's never too late to get started. Jambi thought that she was late in 2022 because she was five years behind every AI researcher who's been using Transformers since it was released in 2017.

Starting point is 01:09:01 And yet, Jambi is now working in Open AI, the company that arguably made the most in utilizing transformers and LAMs. For more in-depth deep dives on how Open AI works, coming from the Open AI team, and on practical guides on AI engineering, check out the pragmatic engineer deep dives, which are linked in the show notes below. If you enjoyed this podcast, please do subscribe on your favorite podcast platform and on YouTube. A special thank you if you leave a review, which greatly helps a podcast. Thanks, and see you in the next one.

The Pragmatic Engineer - From Software Engineer to AI Engineer – with Janvi Kalra

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.