The AI Daily Brief: Artificial Intelligence News and Analysis - "How I Went from Non-Technical to Building an Andrew Tate LLM Chatbot in 3 Months"

Episode Date: May 12, 2023

Today in the first of a series on AI learning journeys, NLW is joined by Emmet Halm. At the beginning of 2023, Emmet was not a developer. A couple weeks ago, he released a chatbot that was tuned on th...e ever controversial Andrew Tate. Along the way, he also released a tool to help others fine tune LLMs on specific media creators. In this conversation, they talk about what it takes to start building in AI, and why there has never been a better time. Subscribe to The AI Breakdown newsletter: https://theaibreakdown.beehiiv.com/subscribe Subscribe to The AI Breakdown on YouTube: https://www.youtube.com/@TheAIBreakdown

Transcript
Discussion (0)
Starting point is 00:00:00 Today on the AI breakdown, the first and what I hope will be a series of interviews on AI learning journeys. Today I'm joined by Emmett Homm, who taught himself to code over the last three months and has just released not only a couple of celebrity chatbots, but also better tools to help others build something similar. The AI breakdown is a daily podcast and video show about the most important news and discussions in AI. If you're enjoying, please like, subscribe, and share, and here is my interview with Emmett. A couple weeks ago, I noticed this kid Emmett who was posting about a chatbot that he had created for Andrew Tate, obviously a very controversial figure, but a kind of cool, interesting use case for training a chatbot on someone's public media and then turning into something that other people could play with. But the most notable thing about Emmett was that he said that only a few months earlier, he was completely non-technical. He hadn't actually known how to code at all. So he went from not coding being non-technical to building an Andrew Tate and later a Chimoth Palahapitia chatbot in just about three or four months.
Starting point is 00:01:03 So I thought that journey was really interesting. I wanted to get into how he actually built that chatbot and what other people can learn. So this interview is about both what he built and how he learned to do it. All right, Emmett, welcome to the AI breakdown. How are you doing, sir? I'm doing well. Thanks very much. I was super excited to do this.
Starting point is 00:01:21 I think it'll be really fun as we were just talking about. You have been building in public, learning in public, and that's really what this new interview series is all about. So I think before we get into broad strokes, what you've been building, just who are you? What's your story? How did you find yourself coming into this space and where you are now? For sure, so that's the big question. But a little bit about me. I'm 21 years old. I grew up in the U.S. I was at Harvard for a year, studying history in Russian. And then COVID happened, big pivot, took a year off, really went down the entrepreneurship private hole. started my first company, which was an ed tech company. So we were like a mix between a startup
Starting point is 00:01:57 accelerator and a college admissions company. Worked on that for a little more than a year. And we really like our secret sauce was we had a really good TikTok account as a really good sales top of funnel. And so eventually that company was acquired by a larger ed tech company primarily for that weed acquisition strategy. And then was back at school first semester. Jaded been interested in crypto since about 2017. And this was back in like end of 2021. So really went down the Dow rabbit hole and thought that my time would be best spent building Dow tooling. And so ended up finding a co-founder, leaving Harvard again and raising and founding this company
Starting point is 00:02:31 called Dow HQ, which is like a Dow Marketplace and Aggregator for these online companies. We worked on that for a little more than a year and then ended up finding an aqua hire deal and winding that down. And so for me, it was after that, there was like a point of thinking, okay, like what do I want to do? I was previously fully non-technical. So in charge of everything from raising, organizing, just like putting people in the right place and getting us where we need to be. It felt like I was strong at that.
Starting point is 00:02:56 But really felt that like the one major thing that was holding me back or that would make me an even better founder in the future was just like having these base technical skills. And so really just started diving in, chatting with my developer friends. Some of our former employees were really smart and seeing, okay, how can I learn just from the basics? Like HTML, CSS, JavaScript, like very much from square one. and luckily, Chachi BT came out at about the exact time that I started going down this journey, so roughly three, four months ago. And so that's been incredible. And so I feel like I'm able to build 10 times faster than my technical understanding
Starting point is 00:03:29 lets me build. And so it's a bit of my cat and mouse of I build way ahead of my technical understanding, and then I have to catch up my technical understanding, and then I make another leap. And overall, it's been a lot of fun and a lot of opportunities have come just from really leaning into it, not being afraid to fail, trying to build as much as possible. ship in public and then just really listen to all the smart people online when they could decide my ideas and give feedback and how I can improve. Super interesting. First of all, history major into ed tech, into crypto. We could have a very long conversation because it's
Starting point is 00:03:58 basically a shared path. So I guess when it comes to AI, had you made the decision to start dabbling with it when you decided to be non-technical or did it just happen to coincide with you decided to make this decision to learn? And then it's the world's greatest learning tool they're waiting for you? I think more so the latter. So I started when chat GPD came out, I was still working on DowHQ. And so I was just using it for a lot of work tasks and then saw that our developers were using it religiously just to debug to build things really quickly. And so I found that very compelling because I often have lots of ideas and things I want to build. But the main bottleneck is finding someone to then build the thing or like using a no code
Starting point is 00:04:39 tool, which can often be quite limited. And so I just started playing around with it to build websites and then set up a list of incremental projects that I wanted to build the first just being like a very bare bones website and just see what I can build with it and my technical understanding and just push myself to build like more and more complex things over time. So I want to talk about some of the things you've built. The one that I think first captured people's attention was the Andrew Tate project or the Andrew Tate bot. But maybe that's wrong.
Starting point is 00:05:08 Was there something before that? Yeah, there were some things before that. But I think that's the first one that really took off. You wrote a thread about sort of the pieces of what it looked like to actually build that. Maybe we can do a video version of that almost, or at least walking through it. Yeah, absolutely. I just remember the one project that did take off before that was I built like an AI chef. So it was very simple prompt engineering, but you put in your ingredients and then it just output something you could make.
Starting point is 00:05:31 So that was like the first thing. But yeah, I built, it's called Top GPT. And I'll go ahead and share my screen. Let's go the entire screen. There we go. All right. This is top GBT, simple chat interface. Obviously, we have Andrew Tate plan words, top GBT.
Starting point is 00:05:47 This was just a shower idea that came to me. I was just thinking that the name was too funny not to do. Your Tate, very controversial, so high viral potential. So for me, I was like, okay, good opportunity to improve my chatbot infrastructure on the back end and then also high viral potential. And so for top GBT, you ask you questions, and it really just mimics the style and voice of Andrew Tate. And so the process by which you do that is called fine tuning.
Starting point is 00:06:10 And so fine-tuning involves, and we'll go into my code right here, involves a process of scraping data. So I scraped primarily from YouTube, cleaning that data. And then for the open AI, fine-tuning API endpoint, what you have to do is put it in this format called prompt completion pair. And so I'm going to dive into that in a minute. And that also relates to another project that I launched that hopefully lets people fine-tune these chatbots much faster. So this is actually a few of these are examples from the latest chat bot that I released last night, but the principle is the same. So the first thing's first, I made Python scripts to quickly scrape YouTube. And I use chat GPT to write a lot of this.
Starting point is 00:06:51 And then just debugged, work with developer friends to like really work out the kinks. And so after I got these scripts, I had things like this. So this is the Chamath Lex Freeman interview transcript. And so I have this huge transcript. And right now I need to figure out what to do to make it useful. So what I also did is I wrote a series of scripts that will take this transcript. The first thing it will do is it's going to pass this entire thing into essentially chat GPT using the chatGBT 3.5 turbo API or the 4 API.
Starting point is 00:07:24 It's going to clean it. And one of the really exciting things, I think it's applicable to a lot of people, is now a lot of data cleaning tasks that are slightly too complicated for that code to do, so for Python to do, but are tedious enough that you wouldn't want to do it and you really wouldn't want to give it to an intern are perfect for chat GPT to do programmatically through the API. So an example is when I'm cleaning this, not only do I want grammar to be fixed, remove words like I'm in, but I also want it to identify who the speakers are and identify context, which is more of like a general intelligence thing that you can't really code into Python, say, hey, infer who's
Starting point is 00:08:00 speaking, but chat GBT is quite good at that. And so, long story short, I pass, I got all these transcripts from a bunch of different videos. In this case, it was Chima. For Andrew Tate, it was just Andrew Tate videos. And then I pass that through, I clean it. So we get these clean transcripts. And then the next script that I pass it through is my prompt completion pariscript.
Starting point is 00:08:20 So what this does is it chunks the text. It passes it to the, and here the 3.5 turbo API. and the prompt is basically, hey, I have all this data, put it in JSONL format. Here's an example, right about 10 or 15 questions. And so this is currently, previously was the big bottleneck to fine-tuning a model. As often you have a big piece of text, but there's not a lot of natural questions in that conversation, maybe just a few. But in order to train it, you need a constant, like a very big document of question response. And the prompt completion script just generates a bunch of synthetic questions.
Starting point is 00:08:57 And then eventually you get something like this in Chamath.Jassonel, where you have this big document where you can see prompt. So it gives you the question and then the completion. And so the completion here is an actual response from, in this case, at Chamath interview transcript. And then most of the questions here are fully made up by OpenAI. So let me just ask you quickly. So is it working backwards from answers basically to create the question? Yeah, that's correct. I posted a version of this on GitHub, which the people have started. It's called Tune AI or Auto-Findune. And so this is that whole process I was walking through. So this is open source. Anyone can use it. You might have to tinker with it a little bit. But when it works, you can run it actually in one line just from your command terminal. And the idea here is like when I was building top GBT, I had to manually just copy and paste into Jad GBT. It took quite a long time because the quality of your data, directly influences the quality of your model. So that's really the big bottleneck. And so here,
Starting point is 00:10:02 if you string all these scripts together, then you can essentially just put in as much, as many YouTube videos as you want, as many podcasts, transcripts as you want, clean them, and then, you know, create the synthetic data. So the prompt completion pairs. And all you have to do is run on open API, on open AI. And so it's pretty cost efficient as well. So like for top GBT, I think this was 150 prompt completion pairs. Cost about a dollar to fine-tune. And then Besti AI for Chamoth, I think it was about 550 prompt completion pairs.
Starting point is 00:10:34 Now it was about $3.5. So overall, works pretty well. Another repo that I posted, this is also part of that same repo is infinite GPT. This is essentially just a Python script that chunks. So if you have a really long transcript or any long piece of data you need to translate or just pass into chatGBT and using the API, this just chunks it.
Starting point is 00:10:59 So it lets you keep passing it in over and over without passing the token limit. So that's a little bit of a show into what I've been working on and how you can quickly fine-tune a chatbot. Super interesting. One of the things that seems to be really interesting right now is you have on the one hand, the sort of arms race among the big players trying to constantly, one up each other and create throw more power at this, more compute at it. Like, Anthropic today just announced their 100K Claude model, right? But at the same time, you have just this absolute legion of folks like yourself who are like tinkering, building, experimenting, just like rapidly
Starting point is 00:11:38 iterating on what you can do with these things. It feels Cambrian explosion is a super overused term, but it really does feel like this moment where everything that can be tried is being tried almost. Yeah, I couldn't agree more. I'd say it's pretty stark both in terms of how much people are learning from each other. So I'm not sure if you've, a lot of people, if you go to GitHub, would highly recommend it just looking at the trending projects. There's a lot of people who are pushing the limits of what you can do with the limited infrastructure that we have right now. So an example of that is like baby AGI or auto GPT, which like while some of their use cases are quite buggy, it's still very impressive that people were able to build that. Just because those are
Starting point is 00:12:19 capabilities. So in this case, those are both two AI agent projects. Those are both capabilities that Open AI has not what the public had have yet, but people just figured out how to string it together in a very clever way. And so I think a lot of innovation is going to come from that. The second, which I find for me has been incredibly useful. And at times, like very funny, is using these tools to learn more and then building tools to help the tool help me more. So it's a very circular thing. So for instance, earlier this week, I realized I should be doing way more with Langchain. So Langchain is a framework for building with large language models. It's building a website in HTML versus building a website in React. It makes it much,
Starting point is 00:12:57 much easier and faster. And so I realized, okay, I'm wasting a lot of time. I should really be building a Langchain. The problem is that chat GPT, just as like a tutor for teaching me how to build, does not have access to Langchain because Langchain is a new development. And so what I had to do is trying to like piece together the tool, like the PDF chatbot tools, download some of those repos, give it the lane chain documentation. So essentially use some of other people's work, a little bit of my own work, and build like a personal chat GPT that knows Langchain, and then it can teach me a lane chain.
Starting point is 00:13:28 So I was like, okay, I have this gone like thing, but I need to give it this info. So let me build a tool to give that information, give it back to it. And now it has the information to then again teach me and help me build forward. So it's like very circular where we're learning from the technology and then we're building tools to help the technology improve so that we can improve. And just like this back and forth, I think, is going to result in a lot of exponential growth and interesting projects in the next few months. Yeah.
Starting point is 00:13:53 It feels like the ability to stretch farther. Like, one of the things you said right at the beginning of the conversation was you're effectively building things you have absolutely no business being able to build. And that allows you to fail in more spectacular ways that allow you to learn, like, way more quickly than you would. Yeah, absolutely. And I think if anyone's thinking about building an AI project or this has an interesting use case, maybe for their work or just for fun.
Starting point is 00:14:16 there's really almost no barrier to entry. Like the place that I start is just asking Chad GPT, hey, I have this idea, here's what I'm thinking, how can I build this? And then ask it some questions. And that's often a very good place for me to start. And it says, okay, for this section, you're going to need a Python script.
Starting point is 00:14:34 And for this, you're going to need a maybe like a Vue app or React app. And then you're going to need a build database. And then you can just have it walk you through each of those steps. And while it can't do it for you yet on its own, it's incredibly helpful. And so I think some mixture of using chat GPT, asking a lot of questions, being willing to just ask stupid questions
Starting point is 00:14:52 and really start from square zero. It's super helpful and then definitely helps to be on Twitter or have some developer friends because there definitely are limitations. And so I often will find bugs that chat CBT has no idea what to do. And so in that case, it's definitely good to go to the adult developers who have actually done proper training
Starting point is 00:15:10 and very grateful for their help. I do think that people underestimate how valuable chat GPT and just these tools in general are from the standpoint of brainstorming for people who are non-technical, what would your three best or biggest tips be if they aspire to call it two to three months from now be actively building interesting applications in this space? Yeah, so I would say the first is watch a YouTube video or read an article about how to use the command terminal. That's like the one in chat chad chd pd pd can help you with that. But if it used the command terminal on your computer, that basically all you have to do is download like a coding environment, so like VS code,
Starting point is 00:15:48 which is free. And then essentially you can just clone other people's repos on GitHub. So like learning how to do that. And you're not really doing any coding. You're just using other people's code. And by playing around with that, you can like tinker it. There's a lot of things you can build. You can use templates.
Starting point is 00:16:01 That's essentially all no code. So I think that would be number one. Number two, just like use chat GPT religiously. It's incredibly useful. It's like getting better at prompting. And so like the more specific you prompt, the better. So typically I'll have quite a long prompt. I'm like giving it an identity, kind of priming it a little bit for what I need.
Starting point is 00:16:19 And then I would say, let's see, on the third thing is it definitely helps to have some technical understanding. And so for me, I realize that what chat, GPT and GitHub co-pilot eliminate is the need to know like a very specific syntax. Like when I was in high school, I took one web development class and we had to write on paper how to build like a structure for a website. And so you're no longer going to have to necessarily know, like, all of the HTML tags, like all of the Python and JavaScript syntax. But what is very helpful is knowing the components in the structure. And so for that, like YouTube is great. I think there's a channel called Free Coding Camp or something along those lines.
Starting point is 00:16:58 I watch some of those videos, having ChatGBTGPT explain things to you. And generally just reading articles or documentation because of components you need, then you can have much more effective props as opposed to saying, hey, chat, GBT, do this magic thing for me. That will get you like a kind of a mediocre result versus HITBTBT. I need you to build a Python script that takes this much data, chunks it for every X tokens, and then outputs this file. So it's like slightly more specific, but it's much more high level. So you don't really have to get any digity with the specific syntax, but just knowing like the structure of that language, like how it communicates, like generally how computers work. And you can do that in afternoon,
Starting point is 00:17:36 just like watching some YouTube videos or reading. What AI trends are, are you most excited about or interested in right now? And maybe more locally, what are you excited to build towards the next few months, the next few months, whatever the interval might be? Yeah, I'm very excited about, unsurprisingly about chatbots in the process of either building personal assistance, building a second brain. I think there's just a ton of traditional business opportunities, whether it's just like traditional businesses that have a lot of proprietary information that either their employees need to be more effective. So you can do like simple Q&A chatbots on a database of proprietary info.
Starting point is 00:18:15 I've had a lot of people reach out to me who are working in trading or in finance, who are thinking about building proprietary language models to then take the massive amount of data they have and help them with their existing strategies or maybe test their thinking on existing strategies. So really exploring chatbots, like the best methods for fine tuning, like fine tuning, vector embedding, prompt engineering. What are all the different tools that you can use to get different outcomes? And so if you've been watching some of the stuff I've been building, each iteration of these chatbots that I've been watching,
Starting point is 00:18:45 I hope they go viral. So I like I'm choosing people who are fun, who are maybe controversial or get a lot of clicks. But underneath is I'm really trying to improve the infrastructure for how to very quickly and effectively build a chat bot around any use case you want. And so that's a little bit of a hint of what I'm working on. And we'll be eventually building that out as more of a software product. So people can go in and we'll give too much information.
Starting point is 00:19:09 from away, but very quickly and efficiently build any chat about you want based on any niche, any person, many different use cases. So currently playing with that. And then I think another area that I'm very excited just to explore in, but I'm curious if you've seen any good examples of this, but I think monetization of AI applications, with the exception of the base layer, so open AI and like maybe pinecone, so just like charging for APIs is very untested. And so I'm curious to see which monetization models work in this environment because it's often hard to charge people. If you're paying open AI $20 a month for essentially godlike powers, it's not very compelling to pay someone else $20 a month for a very specific use case, unless the pain point is really high.
Starting point is 00:19:53 And so I'm just curious to see which monetization models pop up, whether it's traditional SaaS, whether it's a few messages for free, and then there's a paid tier, whether it's something more creative. I'm sure that people will be testing these out in the next few months. And that's something that I'm going to be seeking a bit of time into is figuring out which modernization strategies can work for some of my projects. Super interesting. Yeah, I have a whole bunch of thoughts on that. But Emmett, really awesome to have you joined and shared your kind of learning journey.
Starting point is 00:20:20 Excited to see what you build. Where can people find you if they want to keep track? Yeah. So you can find me at Twitter. So every week I share what I'm building and what I'm learning. And that's just at E-H-H-L-M underscore. Awesome. All right, Emmett.
Starting point is 00:20:32 Great to talk to you and look forward to the next time. Awesome. Thanks, Daniel. Thank you.

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.