Everyday AI Podcast – An AI and ChatGPT Podcast - EP 218: Winning the Probability Game in AI Visuals

Episode Date: February 29, 2024

If you want better AI images and videos, sometimes you have to roll the dice and play the probability game. But you don't have to do it blindly. Tianyu Xu, Founder of TYAI, joins us to share secr...ets on how you can get better AI image and video results.Newsletter: Sign up for our free daily newsletterMore on this Episode: Episode pageJoin the discussion: Ask Jordan and Tianyu questions on AI visualsRelated Episodes:Ep 198: Midjourney V6 – What’s new and producing powerful ad creativesEp 157: Future of AI Video – Pika Labs 1.0, Runway updates, Meta Emu, and moreEp 181: New York Times vs. OpenAI – The huge AI implications no one is talking aboutUpcoming Episodes: Check out the upcoming Everyday AI Livestream lineupWebsite: YourEverydayAI.comEmail The Show: info@youreverydayai.comConnect with Jordan on LinkedInTimestamps:01:30 Daily AI news04:15 About Tianyu and TYAI09:44 Tianyu discusses starting simple with natural language.14:24 Text-to-video models can generate short films.18:23 Experiment with camera settings to optimize motion.20:15 Using AI video generators to speed up processes.24:43 Experiment results favor tilt up movement for video.27:29 Creative rights ownership.35:22 Emphasizing creativity and accessibility in utilizing technology.Topic Covered in This Episode:1. Unpredictability of generative AI visuals2. Process of generating AI images and videos.3. Organizing media outputs and importance of creative rights4. Limitations and challenges in AI video and imagesKeywords:Generative AI, Image/Video Categories, Successful Images, Marketing Videos, AI Technology, Creativity in Businesses, Gradual Process, Pica Labs, Video Generation, Parameters, Video Quality, Consistency in Movement, Everyday AI, Open Source Language Model, Offensive AI results, Gemini AI, OpenAI, Copyright infringement, Market Research, Large Language Models, Text-to-Video Methodology, OpenAI Sora Model, Visual Projects, Prompts Reuse, Creative Rights, Embedding Text in Images, AI Limitations, CGI Animation, Long-Term Projects, ExperimentationSend Everyday AI and Jordan a text message. (We can't reply back unless you leave contact info) Start Here ▶️Not sure where to start when it comes to AI? Start with our Start Here Series. You can listen to the first drop -- Episode 691 -- or get free access to our Inner Cricle community and all episodes: StartHereSeries.com Also, here's a link to the entire series on a Spotify playlist. 

Transcript
Discussion (0)
Starting point is 00:00:00 This is the Everyday AI Show, the everyday podcast where we simplify AI and bring its power to your fingertips. Listen daily for practical advice to boost your career, business, and everyday life. Meet Firefly AI Assistant, now live in Adobe Firefly, the All In One Creative AI Studio. Just describe what you want to create and the assistant handles the rest, orchestrating multi-step workflows across Photoshop, Premiere Express, and more in one conversational interface. You direct the outcome. The assistant accelerates execution. If you want better AI images and better AI videos,
Starting point is 00:00:50 sometimes you got to roll the dice and play the probability game. But don't worry, you don't have to do it blindly. We have an expert today that's going to share his secrets and hopefully allow us all to get better AI images and video. So what's going on, y'all? My name's Jordan Wilson. and I'm the host of Everyday AI. So we're a daily live stream podcast and free daily newsletter
Starting point is 00:01:17 helping everyday people learn and leverage generative AI. So if you are joining us on the podcast, we appreciate that. Make sure to check your show notes. Today is going to be one of those shows that's a little more visual. So you might want to check out the show notes and watch the video. I think we're all going to learn a lot. If you're joining us on the live stream, thank you. We appreciate that.
Starting point is 00:01:38 I'm David joining us and Tara joining us from Nashville. Thank you. So before we get in and we're going to walk everyone through it, I think today's going to be a very fun, visual and interactive episode. But before we get there, let's start as we do with the AI news. And as a reminder, you can always go to your everyday AI.com and sign up for that free daily newsletter. Unlike the other guys, we're actually written by humans. I'm a human. I write the newsletter.
Starting point is 00:02:03 So I'm a human now and I'm going to read the news. All right. So here's what you need to know for AI news. So meta's Lama 3 is reportedly in the works. So meta is set to release an open source language model, Lama 3. And Lama 3 is designed to be more responsive to users and provide context for difficult topics rather than blocking queries. It'll also have the ability to differentiate between words and sensitive or harmless meanings depending on context. And then Lama 3 is estimated to have more than double the parameters of its predecessor, Lama 2.
Starting point is 00:02:38 All right, our next piece of AI news. Speaking of AI images, so Google CEO Sundar Pachai has publicly apologized to employees for the Gemini AI image debacle. So Google CEO Sundar Pachai has issued the apology for the release of the company's artificial intelligence tool, Gemini, after facing backlash for its bias and sometimes offensive results. So the tool was meant to create diverse images with the built-in Imagine 2 image model, but instead produce images of, as an example, America's founding fathers as black, the Pope as a woman, and other historically inaccurate and sometimes biased images. Google has also announced that it's planning to relaunch the AI generator in a few weeks. All right, our last piece of AI news, at least for the podcast here,
Starting point is 00:03:30 did the New York Times hack Open AI? Well, that's what Open AI is alleging. So in a new filing, OpenAI has accused the New York Times of quote unquote hacking their products by using deceptive prompts to generate copies of the New York Times articles in violation of their terms of service. The New York Times had filed a lawsuit against OpenAI for copyright infringement, show showcasing how Open AIs GPT models can produce verbatim copies of their content. But Open AI did claim the hacking was a rare error in their systems learning process, which could be addressed. The New York Times, obviously, has been arguing that their actions were merely a search for evidence of copyrighted content within the AI models. And there is an ongoing debate between Open AI and the New York Times regarding the use of manipulated prompts and using copyrighted copy. So super interesting.
Starting point is 00:04:26 If you're interested in that, I actually had a full, like one hour, very deep dive. I don't know anyone else that went in as deep as I did. So I'll make sure to leave that in the show notes so you can go take a look. But there's always more. So make sure to go to your everyday AI.com and sign up for that free daily newsletter. All right. I'm excited. We're not going to talk about AI news all day.
Starting point is 00:04:49 We're going to talk now about how you can improve your AI visuals, right? So if you've ever used an AI image tool like MidJourney or an AI video tool like runway or PICA Labs, you know, maybe you've got great results. Maybe you didn't. But our guest today is going to help us hopefully get much better results. All right. So let's go ahead and bring it to the show. There we go.
Starting point is 00:05:12 We got him, got him live here. So TNU, a visual artist and founder of TYAI, TNU, thank you for joining the show. Hi, Jordan. Hi, everyone. Thank you very much. All right. This is Kenya from basically Singapore. Yeah, yeah.
Starting point is 00:05:25 Well, it's, yeah, very good morning for me. It's a good evening in Singapore there. So thanks for joining us halfway across the globe. but maybe tell us a little bit about what you do as a visual artist and founder of TYAI. Yeah, actually my background is quite far away from an artist. I won't call myself an artist even now. So I have a background in market research, social media analytics, and advertising sales. So I ventured into generative AI almost at this time last year.
Starting point is 00:05:56 It's very addictive, it's very addictive to me personally, and especially when you create the visuals and videos, and chatting to chat GPT. Last year, I consulted in market research, data analytics, and content production, of course, with the help of AI. And only until the end of last year, I discovered that education should be my focus, because eventually, generative AI will reach everyone and it's an essential skill for anyone to be successful in the future workplace. That's so important because I think, TNU,
Starting point is 00:06:33 there's a misconception, right, because a lot of people think, oh, generative AI is not for me. But I don't think, you know, my personal take is you don't really have an option, right? So whether it's in, you know, two months or two years, I think the average person, even if you're not a creator or marketer or, you know, a visual artist, you're going to be using, you know, some sort of generative AI tool. Is that kind of like your thought on it? Is that kind of where you see things going? Yes, that's my view as well.
Starting point is 00:07:07 And I know it can sound quite, it can sound very, you know, it can sound very complex, especially when you hear the term prompt engineering. There's prompt and there's engineering. It doesn't sound like something that everyone will do. But actually, it's just communication. It's just a way to communicate with AI models. You're used to community with people with humans, and now we are communicating with AI models.
Starting point is 00:07:31 in that way, I think eventually everyone actually has a potential to be a good prompt engineer and everyone should be ready to learn these skills. Absolutely. So I'm curious, how did you transition from market research to now you're putting out, you know, very fascinating and great visual content? And we'll be sharing that in the newsletter as well so you can go look at a lot of his work. But how did you make that transition, you know, from market research? research to, you know, now all of a sudden you're putting out amazing and educational content on
Starting point is 00:08:06 AI visuals. Yeah, this is a great question. And I, you know, even if I'm doing the AI visuals, my methodology is still like a market researcher. So I study the statistics. I study how the models are built. And then I look for the areas where the models are particularly good at. and then I take an analytical approach to when I design the prompts and eventually optimize the prompt.
Starting point is 00:08:38 So whatever I'm doing in the past 12 months have actually have a strong connection to my past experiences. And I'm curious, how did you get to the point where you are breaking it down at such an analytical level? because I feel a lot of people with AI visuals, you know, they'll just go in there, you know, play around, try it once or twice. And, you know, then they kind of give up. And they're like, all right, well, you know, it's okay. But maybe I'll just wait for the technology to get better. So how did you really push yourself to, you know, go in and really just break this down, you know, like a market researcher. Like, how did it get to that point? It's very exciting. As a market researcher, it's like a complete new field for you to venture into. And there's no manual for that. There's no, even if you look at the official documentation from Open AI, from Mead Journey, from the other models, there's no manual for you that teaches you to talk to the model step by step.
Starting point is 00:09:46 So the only way to learn is to experiment. And I start with a very fundamental. I start with a very basic prompt and eventually build up all the variables and all the complex prompts and structures. So I just start from the very basics. I think that's the way to talk to almost every large language model or image model. That's hitting me right there, right?
Starting point is 00:10:16 Like I had a whole episode yesterday, you know, kind of recapping everything about chat chbt. And I think that's that's a perfect explanation is you should always start very simple with, with natural language and kind of see your results and go from there. So I'm curious, TNU, like how many, right? Like when we talk about generations and we'll talk through that process of what that means, but, you know, on average, you know, did you start by doing, you know, two generations and now are you up to like 10, 20, 50. I mean, in general, when you're trying to get that perfect, you know, maybe AI video out of, you know, PICO Labs or something like that, how many generations
Starting point is 00:10:55 are you sometimes doing? It depends on, it depends on model, depends on the subject. So if, I post a lot of things about cats, because it's really easy to do. And, and I just need to generate a few images of the cat of a certain style, then it is good enough for me to showcase different art styles or different things you can do with Dali or with other models. But they are more complex topics. So for example, if the model is not well trained on a subject,
Starting point is 00:11:36 for example, Mid Journey, before V6, Meadjury was unable to create mermaids or scorpion. So maybe you need to try 100 times to get a scorpion. Yeah, in that case, you will need more iterations. For the video model, it also depends on the subject. Some of the subjects are quite easy. For example, Runway and the Pika are both very good at the natural things. For example, the water, moving water, waterfall, anything with water, you just need to create one or two videos. And then they will be almost perfect. But if you want to create something different,
Starting point is 00:12:18 if you have an image, for example, I have a cat, I always use the cat. So I have a cat kayaking on the white water. That could be difficult. That could take more, more tries. Maybe it will take 20, 30 tries to get one video right. You know, it's funny because, you know, if you're listening on the podcast,
Starting point is 00:12:41 I'm kind of laughing. Same thing for me because like even the original like AI image generators, like I did like cats too. Like I was trying to, you know, hey, shout out my cat Rocky, right? I was trying to generate pictures of him. Yeah, some about cats is just like, you know, fun. So a question here from Monica. So thanks for this question. So asking, can you explain a little bit more in depth on what generations are? Yeah, we should probably explain some of this terminology. But yeah, what's a generation and, you know, when you're talking about doing them over and over, Does that just mean that you're running the same prompt over and over, or are you tweaking things each time? So talk a little bit more about what it means to go through another generation.
Starting point is 00:13:21 Yeah, sure. This is a great question. So in the, in generative AI, a generation simply means a generation of an image or generation of a video. We tend to use, we don't tend to use create a video because you don't actually create the video. you generate the video, generate the image with the model. Yeah. And then, sorry, what's your question again? Oh, no, that was perfect.
Starting point is 00:13:51 Just explaining those because I think a lot of people maybe on our show are, you know, using large language models a lot more. Some people might be using, you know, mid-journey or something like that. So I think it's important that we just kind of talk about, you know, what these models even are and what they do, right? Because, you know, and maybe it's just good to explain to the. audience. So you have your, you know, your AI image generators where you can put in either a text or a photo and get a photo on the back end. So, you know, photos like mid-jorney and Dolly. And,
Starting point is 00:14:19 you know, kind of what we're talking about here a little bit more is, you know, these AI video, you know, companies like, like runway and in PICA Labs where you can either put in text and get video or put in photos and get video. So I'm curious, Tieni, like, what's been your kind of best process? Like, do you use photos to get video? Do you use text to get videos? Do you sometimes test to do both and see which one's better? What's kind of been your best approach for generating videos in terms of what you start with? I first tried.
Starting point is 00:14:55 So there are only two ways to generate a video. One is text video, which is very similar to how you interact with chat GPT or mid-Journey. So basically you put a description, the prompt, and then the video, like typically is about three-sept. or four second video will come out based on the prompt. That works on stable diffusion video, PICA, or Rangway. So these are the leading models before SORA. So when SORA comes out, everything changed, everything changed, right? So we are still talking, so I'm still talking about some common practices I do when,
Starting point is 00:15:33 for in the models like runway or PICA. So I discovered that text to video. that text to video is not that robust. It's okay if you just create a shot video or a few shot videos, and you can combine them and make a very extremely short film out of the text to video models. But the consistency is always a challenge. When I say consistency, I mean the consistency of the characters.
Starting point is 00:16:04 For example, if you have a video of, you can use a text prompt to create a cat, kayaking on white water and then you need to create another it's only four seconds so each generation you only have a four second clip then you then to make a longer video you need to generate another one and then you're your prompt you're putting the prompt a white cat uh kayaking on the on the on white water in front of the waterfall and then you might get a completely different white cat from from the from the first video so in that way it's almost impossible to make anything That's why I do not use the text to video methodology.
Starting point is 00:16:45 What I always do is to create all the, it's like building a storyboard. So I create all the images with Dali or with other models with somewhat consistent character first. And then I upload the images to pick up or to runway and animate every single image, making them a four second or eight second video and then combine them so that they look more like a consistent mini film yeah no it's yeah that's that's a good point it's something to you know point out there to you like yeah like you know we talked about opening i sora uh that you can you know put in a text prompt and get up to a minute a video right and then you don't even have to do these multiple generations because it generates multiple scenes together but you know the majority of of people out there
Starting point is 00:17:38 do not have access to that yet. So, you know, we are all having to go through a similar process that you're laying out here. So maybe what we can do is kind of walk our live stream audience through this and maybe show them a little bit of what you're talking about. And again, if you're listening on the podcast, we'll do our best to talk you through this a little bit. But you might want to check out the show notes and come watch this. So let's go ahead and TNU, maybe you can walk us through here.
Starting point is 00:18:05 You know, so we have PICA labs open, right? And we're going to be generating a four-second video. So let's, yeah, maybe just go ahead and walk us through what you're doing here. And then we can talk a little bit about your methodology. Yeah, sure. So right now I'm in P-Car. P-K is, yeah, so P-Kard, within P-Kar, you can do the text video or image-to-video. And here I have uploaded an image of a white cat.
Starting point is 00:18:35 kayaking the white water. So here we're going to make, we're going to animate this image and making it into a three second video. But we have no clue that which direction this image, this video can go. So at the beginning of the process is very similar to rolling the dice. So basically it means that you need to try every direction. You need to try every parameter.
Starting point is 00:19:05 to see which one actually work well. So PICA has a very good camera control feature. So you can, there's a virtual camera that you can, there's a virtual camera that you can control the direction of the camera so that the image will move as a video, the image will move according to the camera movement. So for example, right now I'm at pan left and pan right. Each time when I roll the dice, I try to just choose one direction, just use one variable because it's easier to optimize in the future.
Starting point is 00:19:48 So let's try pan right. And then you can also control the strength of motion. Typically, the lowest motion is zero and the highest is four, but normally, based on my experience, the higher, strength of motion never walk. So I only choose, let's choose one, strength of motion as one. And then that's it, then we can generate a video. So that is only one video.
Starting point is 00:20:17 You never know, you never know whether it's gonna work. So and then we should try the next parameter. How about pen left? So we can do the same. Everything remain the same, but the only variable is pan left. And then we create another video. So the same thing works for tilt up and tilt up. All right.
Starting point is 00:20:45 So I'll give a little bit of background here. So if you're used to maybe using something like chat GPT, you know, sometimes you have to wait, right? But with, you know, Pika Labs and a lot of the other AI video generators, so what Tienu is doing is, you know, he's able to, you know, generate maybe four, five, ten at a time. And you kind of have to wait for them in the background, right? So these different generations, kind of his rolling of the dice, they're all kind of slowly loading one by one. And then we'll be able to see, you know, which of these directions that he kind of put in there, which is going to work best for this specific photo. So one thing that, you know, I'm curious about is, you know, you said that you always start with, you know, like a one motion. Because if you go much higher, you know, it might not work very well.
Starting point is 00:21:34 and you also kind of first test, you know, different directions. I guess is that the best way for people to go? Or maybe should they, you know, as an example, oh, let's test, you know, camera right at one, camera right at two and camera right at three, right? Like, I guess how did you come and get to that this is the best way to start the process? Okay, we can, since we are doing a live demo, I can show you what, How would the different strength of motion look like? So we do pan right with a motion.
Starting point is 00:22:12 Let's do the extreme one. Let's do pan right with the strength of motion of four, which is the fastest. Let's do the fast and furious. Yeah, and see how it works. All right, great. So our original ones that you did with the different areas of motion, all with the one, it looks like they're done.
Starting point is 00:22:32 So yeah, let's walk us through and kind of show us how you, can decide what's good and what's not. Okay, so we already have pan right. The first one is pan right? It seems okay, right? It seems okay because because the kayak is moving from the left to the right. And if your camera moves towards the right, it seems like a natural movement. And yeah, and I think this one is all right.
Starting point is 00:23:01 How about pan left? Okay. So pen left, do you see the problem? If you do the pan left, the water also moves to the left. That goes the opposite direction. Yeah, that one looks very unnatural. It looks a little wild, right? Yeah, this one doesn't work.
Starting point is 00:23:19 How about this one? This one is tuit. I think this is great. This is one, this one is tilt up. And it's showing the kayak going up, going up and down. Yeah, I think this direction also works quite well. And the next one is down. Yeah, that one looks a little weird.
Starting point is 00:23:43 It looks like something ominous is about to happen to the poor cat kayaking. Yeah, the cat is talking. Maybe we can do a lip sync to the cat. Oh, yeah, I see that. Interesting. Yeah, so that's important to talk about, right? So with some of these, you know, video generators, you don't have always a ton of control over the actual movements, right?
Starting point is 00:24:06 So in that instance, the cat was actually just moving its mouth, where in the other instances, it wasn't. Right. So, you know, I guess have you found a way, you know, aside from, and we can talk about, you know, maybe runway in their, you know, multi-motion brush, right? But is there any other way to get that consistency? Or is it just, man, you just got to keep doing generations until you get it exactly right? Yeah. Sorry, before I answer your question, I want to show you this. This is a speed of fall. The strength of motion is four. And you can see that it's totally off.
Starting point is 00:24:46 Yeah. Yeah, there's a wave crashing the white cat on the kayak, a wave crashing him. Mouth is going crazy, right? Yeah, probably something you would never use there. Yeah. And the cat can turn into a complete different animal. Okay. So essentially the higher, we crank the motion there, in general, at least how the technology is now, probably the less usable something becomes.
Starting point is 00:25:12 Yeah. So based on our seven or eight experiments here, we can see that this one, tilt up seems to be the best movement, the best direction for this image. So then what I will do next is to, if I'm a perfectionist, I can continue to generate the same, the same, I can continue to generate the videos with the same setting multiple times, maybe 10 times, just to get a perfect, just to get a perfect video. Or I can add four seconds. For example, I can extend the video to make it eight seconds. or I can edit, I can change the region, change different parts of the video to make it more interesting.
Starting point is 00:26:03 So in that sense, so rolling the dice is not just to roll the dice once, but after rolling the dice, you find the right direction that you double down on that direction, then optimize your prompt, moving, and then you continue to roll the dice. So that's, it's a chain of the activities. And I love this. And, you know, Juan, thanks for this comment saying the same thing. Love the hands on and visual with these real life examples. A couple of questions here, you know, from Tara, a great question here.
Starting point is 00:26:37 And let's just go ahead and we'll go back to here. There we go. So Tara asking, could you please share your strategies for organizing your media outputs? Specifically, I'm interested in how you utilize tagging or memory aids to streamline your process for future projects. That's a great question because, yeah, if you're doing, you know, dozens of generations maybe for the same click, how do you keep that organized? That's a great question. I'm terrible at doing this. I organize them based on projects. So I have different folders for different visual projects. And I also collect all the important prompts for all the custom
Starting point is 00:27:22 instructions for chat GPT. I collect all of them so that I can reuse them in the future. Another good question here from Cecilia. So, and you know, I'm not even sure of this one. So maybe maybe I'll be learning something. So Cecilia asking, can you ask, you know, these, these different AI videos for a storyline after you create the video or do you retain your creative rights? Yeah. So like after you get that short one, do you, you know, can you go in there and then use a text prompt, or do you just kind of keep doing what you did and, you know, keep adding on four seconds?
Starting point is 00:28:02 Adobe just introduced an entirely new way to create, bringing the power and precision of its creative suite into one conversational experience. Meet Firefly AI Assistant, now live in the Adobe Firefly app, the All In One Creative AI Studio. Powered by Adobe's Creative Agent, Firefly AI Assistant lets you start with your vision,
Starting point is 00:28:21 just describe what you want, and shape the outcome as it takes, form with the assistant. The assistant orchestrates multi-step workflows drawing on 60-plus pro-grade tools across Adobe Creative Cloud apps, including Photoshop, Illustrator, Premiere, Lightroom Express, and more to help bring your ideas to life. You can also get started with creative skills, a growing library of pre-built workflows for common creative tasks, like batch editing photos, creating mood boards, portrait retouching, and creating social variations. Every step the assistant takes is visible so you can refine, redirect, or take over at any time. You stay in the
Starting point is 00:29:00 driver's seat as the creative director. Adobe Firefly AI assistant now in public beta. See it today at firefly.adobie.com. So I think there are two parts of the question. The first one is do you, if you want to, if you want to continue the story and you can keep, you can continue at on four seconds for on, but you can only add on a few. But you can only add on a few. but you can only add on a few times, a few instances, maybe add up to 16 seconds. So eventually you still need to create multiple clips from different images or different text prompts to make them into a full film. In terms of the creative rights, it depends on the platforms.
Starting point is 00:29:52 So for example, for me, Jenny, there are certain tiers that you own full rights of the images. For Dali, I think Dali has a, for Dali as well as, so the users who create Dali images own the rights of the images. And then for the other image creation tools or video tools, it all depends on their terms of use. Yeah, and that's important to read those terms because they're always changing. And sometimes, and I've talked about this on the show before, sometimes they're a bit confusing or perplexing, right?
Starting point is 00:30:27 you know, passing off, you know, all, sometimes passing off all liability to the end user. So that's great, great, you know, great piece of advice there, you know, to always read those terms. All right. So another great question here from Yogesh, former guest on the show. How are you doing Yogesh? So asking, have you found a way to create images with text embedded? Yes. What's the best way to do this, TNU, or is there not a great way?
Starting point is 00:30:53 Yeah, this is a great question for the rolling a dice approach. because none of the image models are very good at embedding the text into the image. None of them, because they are not trained to do this. But if you are lucky on Dali, on Me Journey, even on Google's Gemini, you can embed, you can generate images with text, with simple text, simple, like less than five words, then if most, if you're, if you're If you generate less than five words, you can get everything right within three to four tries. So if you try multiple times, like if you try 10 times, for sure you will get one satisfactory image with the text, less than five words with the text properly displayed.
Starting point is 00:31:46 Yeah. Yeah. It's almost like a painful process, right, especially with the text because you know, you get it so close. and then you're like, oh, just got to keep generating and regenerating until you get it. I'm curious. I haven't tried this. Have you tried doing a, you know, something like Yogesh was asking there, you know, something with text, but then creating a video afterwards? So if you get a good, you know, image that has text on it and then trying to create a video, I haven't tried that. I'm curious if you've tried it and if it works at all. Yeah, it works.
Starting point is 00:32:21 It works. It depends on it also. So it's easier to create text on the images than to animate the image with a video tool. Because some of the video models tend to have very limited fonts. So for example, I have something like a welcome written on the cloud. this photo, this image is generated with Dali, and then that word looks like the shape of the cloud. And then I can, then I upload the image to runway.
Starting point is 00:33:04 Then easily, these words, the word welcome, can turn into the same word welcome, but in the font of Ariel or in the normal font that you can find in the word document. So that's the limitation there. Yeah. And so, you know, I'm curious. Like, and we'll make sure to share maybe, you know, your favorite project in the newsletter. So everyone listen and make sure to check the newsletter out, sign up at your EverydayAI.com. But I'm curious, yeah, and you like, what is the most time or the most like generations that you've spent on a single project?
Starting point is 00:33:47 And then maybe, you know, talk a little bit about that project. I spent a lot of time at the beginning when I first used General AI, when I first used Meadjury, I spent a lot of time trying different directions and I was on my phone all the time on Discord. But then after a few months, I figured out the methodology and I began to be more efficient. So I rarely spent more time on any single project. Yeah, so I don't really see anything that will be too time consuming. So maybe I can tell you about a bit about a video that I was making at the beginning of this year.
Starting point is 00:34:46 So I wanted to make a five-minute mini-film based on the CGI character of the cats with the story. And that actually took me some time because to make a proper film with a proper story, you need a lot of scenes and a lot of different characters. And the limitation of the AI model is that it can be trained with a lot of data, a lot of image data from everywhere, but it may not cover everything. So some of the things will be almost impossible to create. Then you have to change the story. You have to change the story based on what images you can create.
Starting point is 00:35:34 That's the time-consuming part. That's a good, yeah. Yeah, that's the thing with generative AI, right? Like sometimes you think you're going to get something consistent and, you know, another video that can work really well with the other videos, and then it's going to spit out something completely random that you can't even work with, right? So, Tanya, we've talked a lot.
Starting point is 00:35:55 We've talked about some of your processes. We've showed the audience live, how to kind of roll the dice and work with these different generations. But as we wrap up here, what is your best piece of advice for people out there? Maybe they're new to these AI video programs, or maybe they're just really struggling to get. get good results. What is, you know, kind of like, especially based on your methodology, right, of kind of rolling the dice, what's your best piece of, you know, advice for everyone
Starting point is 00:36:27 to improve their outputs? I think there's one shortcut that we can, that we can all follow. Basically, for any image model or for any video model, you just look at the successful images that people, that other people create. Or you look at their marketing videos. And then based on the marketing video, you'll know what kind of categories, what image categories, what video categories are easier to create. Then you can, then you just double down your efforts into this category that you have a much higher chance of success. If you just focus on the ones that other people already prove that, prove their success. That's great. And, you know, I think this is important because this allows, and I guess one more question, right?
Starting point is 00:37:14 So I think this allows everyone to be more creative, right? Because I think a lot of times when people hear about these tools, mid-journey, runway, etc., they're like, okay, well, that's not my background. And I think, you know, you just gave a great example, you know, going from someone in market research to now, you know, you're a visual artist. So maybe one last question is what ways might you recommend, you know, other people if they find this, you know, technology fascinating? but what are just some good ways that people can use this, you know, across their, you know, businesses or, you know, personal projects? I think the best way is just to get started. Just start with baby steps. Start talking to chat GPT, start creating images, start with very simple prompts, and then build up your process little by little.
Starting point is 00:38:04 That's great. Yeah, I mean, you have to start small and you have to go step by step. Tieni, you thank you so much for joining the Everyday AI show and telling us all and teaching us all how to roll the dice for better AI visuals. We very much appreciate your time. You're welcome. Thank you so much for your time.
Starting point is 00:38:24 All right. And hey, as a reminder, y'all, we're going to be sharing some of his best examples. If you want to watch the video, maybe if you're listening on the podcast, make sure to go to your everyday AI.com, sign it from the free daily newsletter. If this episode was valuable,
Starting point is 00:38:37 please consider sharing it or sharing it with a friend. leave us a rating, but we hope to see you back tomorrow and every day for more, everyday AI. Thanks y'all. Meet Firefly AI Assistant. Now live in Adobe Firefly, the Allman One Creative AI Studio. Just describe what you want to create in your own words and the assistant handles the rest, orchestrating multi-step workflows across Adobe Creative Cloud apps, including Photoshop, Premiere
Starting point is 00:39:08 Express, and more in one conversational interface. You direct the outcome while the assistant accelerates execution, stand control with the ability to step in and refine at any time. See it today at firefly.adobie.com. And that's a wrap for today's edition of Everyday AI. Thanks for joining us. If you enjoyed this episode, please subscribe and leave us a rating. It helps keep us going. For a little more AI magic, visit Your EverydayAI.com and sign up to our daily newsletter so you don't get left behind. Go break some barriers and we'll see you next time.

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.