Everyday AI Podcast – An AI and ChatGPT Podcast - EP 129: AI Image Generators - The Good, The Bad, and The Awesome

Starting point is 00:00:00 This is the Everyday AI Show, the everyday podcast where we simplify AI and bring its power to your fingertips. Listen daily for practical advice to boost your career, business, and everyday life. Meet Firefly AI Assistant, now live in Adobe Firefly, the all-in-one creative AI studio. Just describe what you want to create and the assistant handles the rest, orchestrating multi-step workflows across Photoshop, Premiere Express, and more in one conversational interface. You direct the outcome. The assistant accelerates execution. There are so many AI image generators, and they are extremely capable.

Starting point is 00:00:55 You know, you've probably heard ones like Mid Journey and Dolly, but there's, I swear, new, amazing AI image generators coming out almost every week. And maybe you're wondering which one's for me. If so, today's episode is for you. Welcome. My name is Jordan Wilson, and this is Everyday AI. It is your daily live stream podcast and free daily newsletter, helping everyday people like you and I, not just make sense of what's going on in the world of AI, but how we can make sense with it or make dollars, right? Make money, grow our businesses, grow our careers. So I'm extremely excited today for our guests, who is,

Starting point is 00:01:33 I will say, bar none, I'd say one of the best out there in sharing and creating AI our So I'm extremely excited for that. But before we get into that, let's go over what's going on in the world of AI news. So speaking of AI image generators, mid-jurney, one of my personal favorites is coming to the browser. So a big move from what I would say is the leader in the AI image-generating field. So everything was previously in Discord, which if you are new to Discord or AI images, it's essentially a separate program they had to log into. So Mid Journey with this pretty big announcement saying that they're moving some AI image generating features inside of their actual website.

Starting point is 00:02:18 So you don't have to go into Discord. All right. Our second piece of news also image related. So poisoning your artwork. Is that a thing of the future? So a new tool called Nightshade allows artists to poison their own artwork in order to disrupt and damage AI models trained on their work without their permission. So this new nightshade can cause AI models to malfunction by injecting little poisoned images into their training data. This to me is extremely interesting, but it could be, in my opinion, too little too late.

Starting point is 00:02:56 So many of these big AI image generating platforms have already been trained on countless images. It might be too little too late, but I like it for artists trying to protect their future work. All right, last but not least. So some big news for the U.S. becoming an AI destination worldwide. So the White House announced 31 designated tech hubs with a focus on AI to improve American competitiveness and will provide grants for each hubs. So the Biden White House today announced that these hubs will receive grants between $40 million and $75 million each. And they are kind of scattered throughout the country. and the focus of these tech hubs will include everything from quantum computing and clean energy to obviously artificial intelligence.

Starting point is 00:03:47 So I'm excited to see what's going to come out of that. There is one, hey, I'm from Chicago. It looks like there's one coming to Illinois. So I'm going to keep my eyes out on what's coming here. But you probably joined today not to hear about the AI news. Maybe you did. Thank you if you do. But you probably want to know a little bit about AI image generators.

Starting point is 00:04:07 I do. I spend a lot of time in these, but my guest today spends a lot more time than I do in a lot more time than most. So please help me welcome to the show. Leo Rodman, he is the founder of Rodman AI. Leo, what's going on? Thank you for joining the show. Hey, so much of pleasure to be here, man. I really appreciate it. Yeah, that's definitely interesting, like you mentioned, about the image poisoning. I mean, I would say the biggest takeaway from that before we jump into the show. show is that I don't think it's going to last forever, right? Like, as you mentioned, they've already trained those, you know, image data sets. And, like, I don't expect that this image poisoning is going to last and keep up with every technological advance in the same way people are claiming they can detect AI. Google actually also said that they have a new project that can detect generative images with 100% accuracy or something like that. And when I hear things like that I'm like, that's never going to hold, right? That's never going to keep. Same. Same. You know,

Starting point is 00:05:11 Leonard, it's kind of like when, you know, Open AI, you know, came out with their, you know, quote unquote, you know, text generation. And they said, oh, we can help you detect if, uh, this text was AI generated. And then they quickly found out like, no, they can't. So they shut it down. But, uh, anyways, I digress. So let's, let's actually quick, quick tell everyone a little bit about what you do, um, you know, with Rodman AI and, uh, in the AI space. Yeah, so I work for a tech company here in Chicago. We're both local. Jordan and I hung out the other day and had drinks, which was cool. In my spare time, my copious free time, I've been working on my personal website and portfolio, Rodman.a.I. I initially started it to provide free diverse clip art because as an instructional designer and L&D executive, I've always found that it's really hard to find diverse clip art. So I was like a pet project of mine, banged out 10,000. thousand free clip art which I still get probably a hundred visitors a day coming by to grab three clip art from me and then I went from that into chat gpti prompts and mid journey prompts

Starting point is 00:06:19 and from there I've been doing other types of image generators so chat gpt as many of you're familiar with is a text generator so I'm creating prompts for different fields different jobs sort of trying to show how it can apply to different people doing different things I think that's really important to show everyone how they can benefit from AI, how it's not just a them thing. It's me too, right? That I also can benefit, whoever you are. Mid-Journey, I've been trying to sort of notice what I'm calling styles within the image generation, meaning that the algorithm is trained on certain images, certain words, and trying to figure out which they are and what they produce.

Starting point is 00:07:03 It doesn't always work out. And the idea is to basically provide people with keywords that they can use in prompting. So I just hit guide number 100 for MidJourneys. I'll take on that until they come out with MidJourney 6th. So I'm focusing on Leonardo, which is a big, so they have a cool name, right? Just like mine. They're a big SD and SDXL, which is stable diffusion and stable diffusion XL-based image generator. There's also another new one called Musavir, which is coming from Dubai, which is very exciting.

Starting point is 00:07:38 I believe it's also SD-based for stable diffusion-based. And then also looking at Dalai, Firefly with Adobe, which has also just come out with a new update. So Firefly 2, Leonardo 2. And really just trying to compare and contrast these image generators. So to show what you can do in each of them, what you can't do in each of them, and really just trying to educate people about how they can best use them and get really cool results. And, and, you know, hey, as a reminder to everyone, Leonard's going to be dropping, I think, a ton of knowledge.

Starting point is 00:08:13 I can't wait. I'm like ready to learn along with y'all. But as a reminder, you know, if you're joining us on the podcast, check out the show notes. Come and join us live. And thank you to our live audience, May Britt saying, yes, this will be a great one. been experimenting a lot, looking forward to everyone's insights. Alar saying big on mid-journey, you know, Doug's just saying good morning. Doug, thank you for joining us. So get your questions in now because I would have loved just FYI. I would have loved to know Leonard like nine months ago.

Starting point is 00:08:45 Like when I was first, you know, or you know, when Dolly 2 came out, you know, he spends so much time and create such great guides. So get your questions in now. But maybe maybe let's start with this, we said, hey, we're going to go over the good, the bad, and the awesome. So maybe let's start kind of in the middle. Like what would you say if we're talking about, hey, what's a good AI image generator? What's maybe a good one for people to start to learn if they're new to the space? Yeah, I would say what I've really been recommending to people the most has been, and am I frozen? No, I think you're good.

Starting point is 00:09:21 Oh, good. All right. So, you know, I think people should really be starting with Dali. That's my recommendation to them. Dolly is really good at following instructions. You don't need to sign up. You can use it on the web. So if you go to Bing.com, B-I-N-G-com,

Starting point is 00:09:38 slash create, that'll take you to the Dolly Image Generator. And let's see if I can share that briefly and get it to work. I got it pulled up right here. Bam. Perfect. All right.

Starting point is 00:09:51 So let's try generating something. I think what's most exciting about Dali is that it follows instruction. So if you tell it, like, I want a dog standing on a truck, holding a sign that says, happy birthday. Adobe just introduced an entirely new way to create, bringing the power and precision of its creative suite into one conversational experience.

Starting point is 00:10:26 Meet Firefly AI Assistant, now live in the Adobe Firefly app, the All-in-One Creative AI Studio. Powered by Adobe's Creative Agent, Firefly AI Assistant, lets you start with your vision, just describe what you want, and shape the outcome as it takes form with the assistant. The assistant orchestrates multi-step workflows, drawing on 60-plus pro-grade tools across Adobe Creative Cloud apps, including Photoshop, Illustrator, Premiere, Lightroom Express, and more to help bring your ideas to life. You can also get started with creative skills, a growing library of pre-built workflows for common creative tasks, like batch

Starting point is 00:11:04 editing photos, creating mood boards, portrait retouching, and creating social variations. Every step the assistant takes is visible so you can refine, redirect, or take over at any time. You stay in the driver's seat as the creative director. Adobe Firefly AI assistant now in public beta. See it today at firefly.adopi.com. Here we go. And if you're listening on the podcast, I'm literally typing in, and this is how easy it is, I'm typing in what Leonard is saying live.

Starting point is 00:11:36 So, you know, we went to Bing.com slash, what was it, slash images or slash create? Plus create. Slash create. And we typed in a dog standing on a truck holding a sign that says, happy birthday. And I'll zoom in a little bit here. And it's pretty impressive, right? Letter? Yeah.

Starting point is 00:11:57 That's not bad, right? I mean, the dog can't really hold things because it doesn't have hands. but pretty good, right? Yeah. And maybe let's talk about this a little bit more because I love that you picked Dolly because I love it. It is easy. But, you know, one thing that it's probably worth talking about because it's something that a lot of people run into issues right away is the text. Right.

Starting point is 00:12:24 So maybe can you just kind of quickly explain, you know, kind of the intricacies or, you know, the pros and cons of working with tax because not all of them handle. this like Dali. You know, if we're looking at the results here, it looks like Dali got the text about like 90% right in three of the four images, which is actually pretty impressive. Yeah. So people who haven't seen AI first are probably like, what's happy Burb day, right? But this is a big step for us. If we look at how things used to come out of or still come out of things like mid journey,

Starting point is 00:12:57 basically image generators were never really taught to read and write. So just like a kid who was never. never taught to read and write if you ask them to draw someone holding a sign, they know that letters are like glyphs and they're going to draw some glyph looking things on that sign, but they don't really understand how to read and write. They don't know cursive. So that's been a really big thing that Dali has been doing that, that it's been bringing us that, whereas other image generators haven't had that before. There's still some guess and check, right? So like, we did it. It's like, okay, not all of them came out. Perfect. But considering we

Starting point is 00:13:33 We gave them like a pretty complex prompts. That was really good output, I think, right? A dog standing on a truck, folding a sign, a bunch of different things there. So if you gave that to mid-journey, you're just gonna end up with an image with a dog, a truck, and a sign in it. And the sign's gonna say some gobbledy cook

Starting point is 00:13:51 that doesn't make sense. But I think, you know, that's what makes Dolly exciting. I'd say that image quality is not quite as good in Dolly. So that's pretty good. It does photos pretty good, but not as good as mid-journey. When you look at a dog, it's a little easier. If you look at a person, for example, their skin texture is more likely to be like a porcelain doll or plastic-y-looking. And I'd say it's really the defects that make people look realistic, right?

Starting point is 00:14:24 Like normal, real human beings have skin tone. They have skin imperfections. And you're used to seeing that. And when you see someone who looks perfect, you're like, oh, they look like a Greek statue. Maybe in a good way, right? But they don't look real to you. And your eye and your brain recognize that. Even if you're not perfectly trained on image recognition and photography, right, we're all used to just what do people look like.

Starting point is 00:14:50 And we have a really good idea of that. And we just, you know, know, know that innately. Yeah. And that's that uncanny valley that people have talked about in the past that's kind of gone away because like all the image generators are so good. They're passed on Kenny Valley, but the idea is like it looks like a human, but not quite. So it sets off kind of like this alien part of your brain where you're like, that's an imposter. And just, hey, just as a reminder, unfortunately, Leonard's screen is frozen. Don't worry about it. We've got his audio. We've got the screen share going. So don't worry about

Starting point is 00:15:25 that. We can still tap into all of his insights. So Leonard, this is a great example. And I love that. we started with Dolly because it's it's also aside from what Leonard said by going, you know, to Bing, Bing.com and accessing it that way is you can also, if you have the paid version of chat GPT, you can access it there as well. All right. So let's let's maybe transition Leonard. So the good we said is is Dolly 3 and it handled text really, really well. Maybe what's one maybe not bad, but what's one that maybe kind of still needs to be improved a little bit. Okay.

Starting point is 00:16:05 Well, before Dali came out, everyone was really excited about Ideogram. And I haven't checked them out in a while. But Ideogram was kind of the first people to do text well. And people got really excited about it, myself included. I started including robots holding signs that say this, that, and the other for like weeks, just because I was so excited about the idea to communicate in that way. And it's still pretty good. I think that the press that they got from being first to do text really helped them.

Starting point is 00:16:37 They're still getting there. I've checked on them periodically, and let's see what we get out of this. Yeah, well, we're going to try the same. So if you're listening on the podcast, you know, ideogram, I believe, I could be wrong. I think it was started by former Google, I believe it was. but it's it's it has some some really heavy hitters on the founding team so it's it's not just you know one of these that's you know kind of started in in someone's basement so i did uh the exact same uh prompt that leonard suggested for uh being chat we did the same thing dog standing on a truck

Starting point is 00:17:14 holding a sign that says happy birthday so the results here not as good but you know again Not bad, right? Not terrible. No. So like totally usable. I would say if you wanted bad, bad, you could look at like Dolly 2, for example, which was one of the first image generators I used like a year ago. And I was using it at work and I was like, hey, can we use this to make assets that we could

Starting point is 00:17:36 actually use with like, could we show this to a human? And the answer was like, no. Like everything that comes out of Dolly has like two heads and 20 fingers. And I can talk a little bit about that as well. Well, that's essentially the same thing as signs and text, right? That fingers are really tricky and tough. They do a lot of different things. It can be complicated.

Starting point is 00:18:01 Artists have trouble with them. And basically just you need training to do it well. So, you know, just like a human artist would need training to do fingers well and AI needs training. And until it got that training, we had a lot of AIs putting stuff out with like 20 fingers, which that's like obviously we're on. Now we're down to the point where it's just like the skin texture is imperfect and occasionally you might get like one extra finger Sometimes you still get an extra hand in mid journey or in Dolly which surprises me But you know that problem is mostly in the past and now we're just down to like really refining it So you know we're so close to the point where you can't even tell the difference between a photograph and something generated by AI and that's crazy

Starting point is 00:18:45 Yeah and and Leonard I think you bring up such a good point because, you know, when these kind of AI image, AI image generators first debuted, you know, specifically if we're talking about Dolly 2, that actually predated chat GPT, which a lot of people don't realize. So some of these early AI image generators, people shared them and they went viral, maybe for not the right reasons, maybe for, hey, this person has three hands or this person has nine fingers, but, you know, the quality now is fantastic. It is, you know, even myself, I've taken more than 500,000 photos before in my life, right?

Starting point is 00:19:25 I was a photographer-ish in my former life. And now, can you even tell the difference? Like, or do you really have to, like, stare at an image for a long time to see if it's real or if it's AI generated? Well, each AI has certain specific things that it does wrong that you can kind of learn to recognize. So I would say that in like a split second, I can tell still if something's AI generated or not. But I'm also in the same boat where I've done photography for like 30 years. It should take fewer photos back when it was on film. Once it switched to being on like a two gig or bigger, two terabyte flash card, you can just take like, you know, you look at like the number of photos you can take and it literally runs off the edge of the screen on your digital camera just because it's like you can take like a billion billion photos.

Starting point is 00:20:15 But, you know, you sort of learn to recognize stuff like that. I think everyone can still see it. I think someone who's less expert might be fooled initially for longer. But I think that really anyone could look at one of these AI-generated images and recognize that it's not real. And not even necessarily just because of what I would call miscalculations. So, for example, in the image we have on screen, the dog is holding a sign that says, happy birthday he's holding it with like a stick that kind of like passes through his chin instead of

Starting point is 00:20:47 his mouth and the truck he's standing on only has one side the right side not the left side um so you can see things like that but you would also recognize it in like um you know the glint in an eye for example or specular things involving lighting can be really a big clue um you know I think really what's more important, though, is just getting all these different image generators so close to reality that they're really usable. So for me, what's most exciting recently is that Leonardo came out with a new model based on SDXL, Stable Diffusion XL. And I'd say this is really the first true competitor I've seen to Mid Journey, where I've pulled up images from Leonardo and Mid Journey, and occasionally Leonardo actually does it better. we're in the past. Mid Journey just won every single battle, every single heads up.

Starting point is 00:21:44 So that's really exciting, just seeing like real competitors in the space. Yeah, absolutely. And, you know, I feel the same way. It seems like, you know, for many months, at least in my opinion, Mid Journey was kind of running away with it. And then we got Dolly 3. And yeah, some of these new models are fantastic. So actually here, before we get into the awesome,

Starting point is 00:22:06 Before we get to kind of the final phase, there's a couple great questions here, and please continue to get your questions in. So I'm going to let you handle this one here, Leonard. So Monica asking, Leonard, are there any restrictions using these images for commercial purposes? Great question. Probably should have got to that sooner maybe. But yeah, Leonard, what is the kind of the general school of thought? Because there is technically no quote unquote law out there.

Starting point is 00:22:33 But what is the school of thought or best practices for use? using AI images in for commercial purposes? Yeah, so I'm using it currently to work for commercial purposes. I know lots of other people who are and nobody's gotten sued yet. I'm sure someone will get sued eventually. I imagine it's going to be some big firm because generally you want to sue people who have money and not people who don't have money. And any lawyer will tell you that.

Starting point is 00:23:00 Let's find someone with money to sue. Not someone who doesn't have it. I think really the biggest place you might get yourself into trouble, potentially is if you were like going out of your way to try to rip off someone else's work. Or if you like released a soda and it looked exactly like Coca-Cola, but it was like Koki-Cola or something like that. Then they might come after you and be pissed. Other than that, I really think that it's pretty much fair game and safe to use generated images for commercial purposes. I would just advise people to like count your fingers and toes and check your quality,

Starting point is 00:23:32 but I wouldn't be super concerned about the legality. Of course, be, you know, aware that I'm not a lawyer. But I have talked to a bunch of lawyers about this and so is Jordan. And I don't think anyone's really going after individual creators yet. You know, it's really just things that are obviously illegal that I would shy away from. Yeah. It's, it is important to talk about, right? Because that was the first hesitation. Well, I believe the first wave was like, okay, this isn't good enough to use for commercial purposes, but then as, you know, probably once mid-journey got to, you know, version, you know, five point whatever and now Dali 3, and now, you know, we're talking about Leonardo with some

Starting point is 00:24:13 new models. Now it's like, okay, yeah, now they're definitely good enough for commercial purposes. So it's, it is, yeah, but there's, there has been, uh, plenty of lawsuits already, but mainly geared, um, you know, at the makers themselves. So, you know, we talked about stable diffusion, which is a model. They've taken on a bunch of lawsuits. But yeah, as far as I know, there hasn't been any individuals kind of targeted because, yeah, it's a gray area right now, which does make it a little tricky. But I will say that it is very widespread and commonplace to be using AI-imaged,

Starting point is 00:24:51 AI-generated images for commercial purposes. Here, we'll do one more before we get to the awesome, Leonard. So Mabred asking, what is your favorite generator to generate high quality pictures for ads or website content? And maybe this is the transition to the awesome, right? Yeah, I would say definitely go with mid-jurney for the best quality. Like I mentioned, Leonardo can be a really close second sometimes. So they both cost about the same, 30 bucks or 40 bucks a month for their sort of medium level that most creators can make good use of. So I would say, yeah, try out mid-journey, give Leonardo a shot.

Starting point is 00:25:34 I would probably say you'll end up with mid-journey, most likely, but see what sort of suits your taste. I would say Leonardo gives you more shots that, like, a real person would have taken. And mid-journey gives you more dramatic and dynamic professional photographer type ones. But, yeah. All right, so let's look at our dog standing on. a truck here that Jordan's rendering for us. We kind of got lucky. So Mid-Journey doesn't really understand things like dogs standing on a truck, but usually

Starting point is 00:26:05 if you've a picture of a dog in a truck, the dog's going to be standing on the truck or sitting in the truck bed or something like that. As you can see, the text did not come out and gave us some random letters. But those are actually an improvement. So clearly they've been working on this and must be working on this for the new one. You used to get like a bunch of letters that didn't even look like English alphabet letters. And now at least all the letters look like English alphabet letters, which is interesting. So Mid Journey and these other generators will like sneak little mini updates in.

Starting point is 00:26:34 They actually work on the live version of their product instead of releasing it. So like one day, all of a sudden you get like a new feature that you don't even know about, which might be something they announce or might be something like, you know, better looking letters as we work on research about letters. Yeah. And you know, if you're listening on the podcast, yeah, Leonard was kind of giving us the right now, but we put the exact same prompt that we've been using in some of these different image generators into mid-Journey.

Starting point is 00:27:01 And yeah, mid-Journey right now obviously struggles with text. But in terms of, you know, photo quality, I don't, you know, now that I'm looking at it, I would say there's less maybe errors aside from the text. There's maybe less errors in mid-Journey. But overall, if, I don't know, if I had to use one today, Because of the text, I might use in this very specific use case, the one that we generated from Bing using Dolly 3, just because it came out with a nice sign, but there were some errors. And that's sort of where I stand, too, where if I really need it to follow my instructions precisely, or especially if I need letters, I would say go with Dolly. And it's getting closer and closer.

Starting point is 00:27:52 So that's exciting for everyone. I'm sure it makes me a journey nervous, but I'm sure they have something up their sleeve. Oh, yeah, absolutely. So, all right, here, we have another question, and I'm also going to throw on some tips. But Leonard, so Dr. Harvey Castor asking, what are some top suggestions for prompts, tips, etc? And I'm going to go ahead if you're listening on the podcast, Leonard said, so I kid you not, he has a hundred guides already. But Leonard, maybe as I even scroll through something, what are some of those suggestions for prompts and tips, maybe just for Dali, but we can also talk in general. Yeah.

Starting point is 00:28:33 So in general, once Jordan gets to the fourth slide here, yeah. I put on all my prompts guides, this sort of short and sweet summary of some ways to what I call dress your styles or your principles. prompts, so to dress it up a little bit with descriptive words. And so here I've got kind of five hot ones that I recommend people. So mood, and that isn't just the mood of the characters, but it's the mood of the whole scene. Right. So do you want to happy and joy a scene photograph? Is it, you know, what kind of mood is it supposed to evoke in the person who's viewing it?

Starting point is 00:29:13 So you can throw some words about mood in. I think that lighting can be helpful. So throwing in the type of lighting, you will. want. And it doesn't have to be a fancy word like Kira's Guru or whatever. It can just be bright, dim, right? So just trying to communicate what you want. What kind of medium you want. So do you want a photograph? If you do, tell it, right, every single time. And then you're going to get a lot more photographs. Whereas you saw that we got a mixture of photographs and illustrations and other things. Throwing a camera in there can be helpful. And people go nuts with this one with like every little

Starting point is 00:29:49 detail about the camera. And that's not really how the image generator works. But if you do put a professional camera name in there, you're more likely to get a professional looking photo because people uploaded a bunch of photos tagged Canon ES Mark 5D. And if you own a $10,000 camera, you're usually pretty good at photography. So that gives you like a little bit of a gate around like what kind of images are going to go into your training set. You can also do things like iPhone if you want a less professional photo. GoPro, if you want something action oriented, Polaroid or other old school cameras,

Starting point is 00:30:25 if you want some more old school looking. Composition can also be helpful. So if you want something more action oriented, you might want skewed or off-centered. If you want everything centered, you might want to put that in. If you want the character on the left or the right, you might want to put that in.

Starting point is 00:30:42 I frequently find that if I'm not using text, you can flip an image left or right, But if you're using texts, then you're limited. And it's definitely helpful just in terms of graphic design. If you have a character, they should be looking at the rest of your page. It's like if I were to put someone in the bottom right of this page here, they should be looking up and to the left at the text because that directs the viewer's eyes. Mid Journey has a bunch of parameters that don't necessarily apply to other things.

Starting point is 00:31:11 So aspect ratio, that's coming to other image generators, but not there yet, I believe, for Dolly. Niji, that's super specific to Mid Journey, but it's its own specific anime style. Style raw, where you could also use things like style expressive for some, excuse me, different subsets within Mid Journey, and they also have weird. But really, the first things up top, mood, lighting, medium, those are the things that are going to work in any image generator, and they're really going to improve your prompts just by communicating to the computer what you want.

Starting point is 00:31:47 And then the other thing is just to iterate, to try over and over. So if you don't get it right the first time, you try it again with some new keywords. You can even use a thesaurus, so you might even use chat GPT. I wouldn't use it to write your prompts, but you could use chat GPT to give you suggestions for a word to throw in, her word swap. So for example, for a long time I was trying to get clear plastic tubing. And I couldn't get it. So I eventually learned that you have to try synonyms.

Starting point is 00:32:16 So I tried like glass, ice, crystal, leaded crystal, translucent, right? And you just try all these alternative keywords. And that's how you eventually get at what you're looking for. So you might not describe it exactly how it is. You want to find a way to describe it that communicates your need to the computer. Yeah. It's so good there. Like, y'all, like we'll share a lot of this.

Starting point is 00:32:43 And, you know, Leonard, if you're not already following him on on LinkedIn, I suggest you do so, especially if you're interested in AI image generator, because he has now a hundred of these guides, and they are so, so good. So, all right, we have a lot of questions here, Leonard. I don't want to keep you forever, but let's go, let's go just quick, rapid fire and see if we can't get a couple questions answered here. So Ben asking any tips for how to get images of people or animals to look less CGI like? So I'd say number one realistic photograph, throw that keyword in there.

Starting point is 00:33:21 If you're using something other than honestly, mid-journey or the latest Leonardo, you're probably going to get things that look kind of CGI like. That's that uncanny value and skin texture I was talking about before. So really, Moosevere is also one that's pretty good at this point, but they're in beta, so that doesn't help you. So really just you've got to go with Mid-Journey or Leonardo and pay for it. for it, I'd say. Use the keyword, you know, realistic photograph. You can use things like cinematographic or people use 8K, which doesn't really make it 8K, but they think it does.

Starting point is 00:33:55 I don't know, people like those outputs. You can also, like I mentioned, mention a fancy, expensive camera, and I think that can also be helpful. Great question. Great tips. Great tips. All right, here we got another one from Brian. Brian, thank you for the question. So he said, what about copywriting the images that you actually generate. Is that a thing? Does it work? So people slap their logos on, and that does nothing except for make it harder for someone steal your work, which that's fine.

Starting point is 00:34:23 I don't object to that. If you really want to copyright one of these images, you have to put in 51% of the work, which no court has really determined what that means yet. So if you took like two different works, then you photoshopped the heck out of it for an hour or two hours and videoed yourself photoshopping it and then sent that in you could probably copyright it but honestly i would probably just wait a little bit longer um i think that more likely you know not even more likely definitely you know you own the copyright on the design you put on top of an image in Photoshop or whatever but right now it's probably not looking good if

Starting point is 00:35:04 you want to copyright something that's a direct output until you put all that extra work in and even that it's not really settled. Yeah, yeah, Brian, this is one that's continuing to go through the courts. So far, I don't think anyone has successfully been able to copyright something they produced in terms of just strictly AI image generator. But like what Leonard said, it is kind of ongoing. All right, we got another one here. Mike, is there a way to upload an image and have it modified?

Starting point is 00:35:31 Yeah, and the next person also, Anibol, also has something pretty similar. Can you correct an image? So can you upload an image and modify it? You can, depending on what you mean by modified. So in Mid Journey, you can upload an image in and use it to inspire your next work. But I found that it's really just turning that image into text and the text back into an image. And maybe following some of your character placement a little bit. If you really want to modify it, I would say bring it into Leonardo, which will let you do.

Starting point is 00:36:05 It'll let you upload an image and then directly. modify like a section of it, for example, which is something you can do in Mid Journey with an image you created in Mid Journey, but you can't do with an upload. Whereas in Leonardo, you can modify a piece of an image like a Needball asked for detail editing or to take fingers out or add fingers. So you can do that in Leonardo and Mid Journey with images you've made with them, but only in Leonardo. Can you do it with an upload? Yeah, great, great question. All right. And we got to our last one here. So Monica, since you got the two for one there answering Anabal's question. So Monica asking, do you have any data or case studies on performance for ads using AI

Starting point is 00:36:48 generated images versus real photographs? I don't have any data. I've definitely seen, though, and you probably have two. I believe Jordan talked about the study where they found that venture capital pitches that were generated by AI got like 50% more funding or something like that. And it's really just about ticking the boxes, right? Like those venture capital pitches probably did like a more standard thing and didn't leave anything out, whereas a lot of people applying for money probably are not great at writing pitches. When you're generating images with real photographs versus or generating images versus real photographs, you know, I think that something I've had trouble with is a web designer and doing advertising design and doing learning

Starting point is 00:37:34 design is you can never find the picture that's exactly right for your purpose unless you literally go out and take it yourself. So AI gives you the ability to have an image that suits your storytelling. And not only that, we didn't really talk about this, but I think it's really going to change storytelling, because in the past you would write your copy in just words and then try to find images later. And now I think people are going to be coming up with images as part of their flow because they know they can make the perfect image. And it's going to become part of of storytelling in a way that it wasn't before. So no case studies, but I think that AI generated images for advertising are going to

Starting point is 00:38:12 blow everyone away. That. What like, I wish I had something to like pound a bunch of emojis on what Leonard just said. But I think what he just said there is a gem of where we're going in marketing and advertising. And I know this even from personal experience. A lot of times you would really have to massage the copy or other parts of a campaign to fit the image that you had because maybe you had very limited imagery, but now with AI image generators, it's vice versa. And that's very exciting for the future of marketing and advertising.

Starting point is 00:38:43 So we've, Leonard, we've kept you for very long, but I want to give you the chance. So if someone now is very interested, so we're going to share all of your work, all your website, but maybe give everyone just that one last piece of advice if they want to get into, you know, being better at creating AI generated images. if they want to go from zero to five or from five to ten, what is your best piece of advice for people to really up their game now that we've seen kind of the good, the bad, and the awesome? Yeah, I mean, depending on how you want to look at it,

Starting point is 00:39:16 you can call it homework, you can call it play, but really just get in there for 10, 15 minutes a day, pick an image generator. It could be the same thing for chat GPT, just write prompts for 10 to 15 minutes a day. Make sure you're iterating. So you take your output and you say, how can I make that better?

Starting point is 00:39:32 and you try it again and again. But really just getting in there and playing. I think another great way is taking what you get out of your play and posting it on the internet and seeing what people say about it. People are generally surprisingly helpful, actually, about this particular topic area. Everything else on the internet, they're pretty mean. But people about AI seem to be pretty nice,

Starting point is 00:39:51 and they're trying to help people. So, true. And hey, don't worry if you didn't catch the whole thing, if you weren't able to take notes fast enough. We're going to recap everything that Leonard and I talked about. So go to your EverydayAI.com. Sign up for that free daily newsletter. Leonard, thank you so much for joining us.

Starting point is 00:40:10 I'm kind of bummed your video froze, but your insights were on point. Thank you so much for coming on the show. Yeah, it was my pleasure. And thanks so much for having me. Everyone hope to keep in touch. Feel free to shoot me a message anytime with any questions. Absolutely.

Starting point is 00:40:25 Go check today's newsletter. We're going to have a ton of more information, you know, from Leonard's website, more ways that you can connect and engage with him because it's a great way to grow is to connect with him as well. So we hope you enjoyed this and we hope to see you back. Actually, I got to quickly plug. We're going to be building tomorrow and Thursday. We're going to be building a brand live from scratch with AI throughout multiple parts. People have always asked for something, a show like this. So I'm excited. If you want to know how to actually use AI from ideation to publication, join us tomorrow and the day.

Starting point is 00:41:00 and every other day at everyday AI. Thanks, y'all. Thanks, everyone. Meet Firefly AI Assistant. Now live in Adobe Firefly, the Allman One Creative AI Studio. Just describe what you want to create in your own words and the assistant handles the rest, orchestrating multi-step workflows across Adobe Creative Cloud apps,

Starting point is 00:41:24 including Photoshop, Premier Express, and more in one conversational interface. You direct the outcome while the assistant accelerates execution. Stand control with the ability to step in and refine at any. any time. See it today at firefly.adobie.com. And that's a wrap for today's edition of Everyday AI. Thanks for joining us. If you enjoyed this episode, please subscribe and leave us a rating. It helps keep us going. For a little more AI magic, visit Your EverydayAI.com and sign up to our daily newsletter so you don't get left behind. Go break some barriers and we'll see you next time.

Everyday AI Podcast – An AI and ChatGPT Podcast - EP 129: AI Image Generators - The Good, The Bad, and The Awesome

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.