Everyday AI Podcast – An AI and ChatGPT Podcast - EP 514: Google’s AI Studio - 5 time-consuming tasks you didn’t know you can automate

Starting point is 00:00:00 This is the Everyday AI Show, the Everyday Podcast where we simplify AI and bring its power to your fingertips. Listen daily for practical advice to boost your career, business, and everyday life. Meet Firefly AI Assistant, now live in Adobe Firefly, the All In One Creative AI Studio. Just describe what you want to create and the assistant handles the rest, orchestrating multi-step workflows across Photoshop, Premiere Express, and more in one conversational interface. You direct the outcome. The assistant accelerates execution. You probably know a lot of Google's AI products and offerings, right?

Starting point is 00:00:52 Maybe you use Google Gemini as a chatbot like you would chat GPT. Maybe you use Google Gemini inside your workspace apps like Google Docs and Google Sheets. Or maybe you use one of my favorite tools from Google Powered. by Google Gemini, Nopo Galem. But there is a legit hidden gem that I think most people don't know about. And that is Google's AI studio. And I think for a lot of use cases, it actually might be Google's best AI AI. And I don't think many people know and most people aren't using it.

Starting point is 00:01:39 And there's been a lot of recent. updates just in the last few weeks that I think make Google AI Studio even better. All right. So we're going to be going over that today and talking about inside Google AI Studio five time-consuming tasks that you probably didn't know you could automate. All right. I'm excited for this one. I hope you are too. If you're new here, what's going on, y'all?

Starting point is 00:02:04 My name's Jordan Wilson and I'm the host of Everyday AI. We're a daily live stream podcast and free daily newsletter, helping us. all not just learn AI, but how we can leverage it to grow our companies and grow our careers. Is that what you're trying to do? If so, it starts here. This is where you learn with the live stream, with the podcast, but where you leverage it and where you actually put it to work, that's on our website. Number one, on our free email newsletter that we put out every single day.

Starting point is 00:02:33 We're going to be recapping the highlights of today's show. So maybe you're driving or walking, you're dog, and you're like, oh, what was that? It's going to be in the newsletter. But also while you're there, we have more than 500 episodes. We've interviewed hundreds of the world's leading experts on AI across a variety of topics. So no matter what you're trying to learn, our website is where you go and make that happen. It is a free generative AI university.

Starting point is 00:02:58 All right. Normally, most days, we go over the AI news. There's actually a ton going on. But I didn't want this one to turn into too long of a show, wanted to keep it very factual, you know, hard hitting. So if you want the AI news, that's going to be. be in our newsletter. All right.

Starting point is 00:03:13 Here we go. Let's just jump into, well, no, I got a shout out all my people here, McDonald, joining from Chicago. Yeah, we're doing this live, y'all. I like to say, you know, at everyday AI, we're the realest thing in artificial intelligence, right? So much of what you see online or listen to or watch, right? It's very fabricated.

Starting point is 00:03:32 It's very scripted. That's not what we're going to do here. We're going to be trying to do a lot of this stuff live. There's going to be some bugs, I'm sure. So thanks for joining us, McDonald. Kelly joining from Syracuse. Dennis joining on LinkedIn from New York City. Love to see it.

Starting point is 00:03:47 Noel joining from Chester, England. Aidan, not too far away from Indiana. Jackie, happy Tuesday. I'll be saying Jackie tomorrow at DePaul. That'll be fun. A lot of people. Got a shout out Dr. Harvey Castro, joining us from Berlin, Germany. All right.

Starting point is 00:04:06 Let's get into the use cases, y'all. And it is Tuesday. So here's my hot take for Tuesday. Google AI Studio is the best AI tool you've probably never heard of, right? If you're an avid AI user, right, you're using large language models hours a day. You've probably heard of AI Studio. But if you're more casual, you probably haven't or you probably at least don't know what it's capable of, you know?

Starting point is 00:04:36 I'm curious, live stream audience. We've got a ton of people in the house today. How often do you use AI Studio or have you never used it? I want to know. I'm also kind of curious because I think most people either haven't heard of it or they don't know what it's capable of. And we're going to be going over those five time-consuming tasks that we can do, that we can, you know, automate.

Starting point is 00:05:01 So let's start at the top. What the heck is Google AI Studio? Well, number one, it looks a lot cleaner than And it did a month ago, right? So a shout out to the Google team. They really cleaned up the user interface and the user experience. And why am I mentioning the user interface and user experience for an AI tool? Well, it is more for developers, right?

Starting point is 00:05:25 So at least initially, Google AI Studio was really made for developers. It's a place where, you know, people who are a little more technical can go in. They can, you know, work on some of their code. They can work on structured prompts. can work on, you know, how they're using Google's API on the back end, right? So, you know, let's say that your company is, is maybe fine-tuning one of Google's models or you're using their new, you know, Gemma 3, which I think is an amazing small model, right? So maybe you're using that, you know, on-device, on-prem, you're fine-tuning it, whatever it may be. But, you know,

Starting point is 00:06:02 I think Google AI Studio was originally just created to make developers' lives easier, right? In the same way that Open AI has their playground, but the difference is AI Studio is like legit, a full-fledged working product. And it's for much more than developers. I actually think the everyday non-technical person, such as myself, I'm not super technical, right? I'm not fine-tuning models aside from just testing it out inside of AI studio. But I think for non-technical everyday people, there's so much to explore, so much you probably don't know, in Google AI. Studio. So yes,

Starting point is 00:06:41 originally kind of set up for AI developers over the last, I'd say six months, it's really grown into so much more. And I think that there's a lot of tools and features that a lot of other companies have kind of promised, but maybe haven't fully delivered on yet. And Google AI Studio has delivered. So is anyone using this?

Starting point is 00:07:02 All right. So Allison says a handful of times. Heather says, heard of haven't used. Angie says barely. All right. Kelly hasn't used it. Jay hasn't used it.

Starting point is 00:07:15 All right. A lot of people haven't. Noel says just heard of it. Dennis is using it weekly. All right. So some of you are using it. Some of you have never heard of it. So buckle up.

Starting point is 00:07:23 We're going to, we're going to go into it. But at its core, originally kind of made to help developers, uh, make better apps with Google's API, right? Because there's so many, uh, tools and service.

Starting point is 00:07:37 out there, tens of thousands that you use that are powered by, you know, either Google Gemini on the backend, Open AI, Claude, et cetera, right? So developers usually have kind of like a sandbox or a playground that they can go experiment and make their products better before releasing it to us all in the world, right? So think of those, you know, all these AI tools. You're like, wow, how do they work? Well, generally they're running on, you know, Google, Open AI or Claude. And developers need a spot to go and play around with it and make it better. And that's why, you know, talking about the UI-U-X, there's a lot more, right? So if you just log on to jemini.com, it's an AI chatbot, right? You're just chatting with Google Gemini. You can still select your model, but for the most part,

Starting point is 00:08:19 you don't have a lot of controls. Inside Google's AI studio, there's a lot of controls. So you can control the temperature, change your model, toggle certain tools on or off for structured output, code execution, function calling to bring in kind of third-part API's grounding with Google search, which is new and amazing. My gosh, it is actually, it got the, uh, the, the top spot on the LAM arena board for internet connected search inside Google AI studio. It's so good. Okay.

Starting point is 00:08:48 So that's kind of a, the gist of it. And here's the other thing. It's free. Yeah. It's free. Uh, some more updates here, but let me put out, uh, kind of, uh, let me read at first what Google says about this, um, kind of on page. services versus paid services because there's an update here that I think people are going to like

Starting point is 00:09:07 because previously you know Google AI studio it just anything that you put in there it used that information to train its models right so and that is still the case for free so if you are on a free Gemini account this is what Google says and how they use your data so it says when you use unpaid services including for example Google AI studio and the unpaid quota on on Gemini and I API, Google uses the content you submit to the services and any generated responses to provide, improve, develop Google products and services and machine learning technologies. All right. So if you are on a free Google plan, you probably shouldn't be uploading confidential,

Starting point is 00:09:50 proprietary data from your company because Google, if you are on the free plan, it does use that to train its model inside AI studio. However, and this was just updated, I believe in. April. So just like this month, this was updated. So now if you are paying for Google Gemini and you have a cloud billing account set up, not technically, you know, it's not hard. It took me like 30 seconds when I originally set up, you know, this, I don't know, a year or so ago. But here's what it says now. So it says now when you use paid services, including, for example, the paid quota of the Gemini API, Google doesn't use your prompt or responses to. to improve our products. So that's an update. I don't even know if Google really announced this and I didn't really see anyone talking about this,

Starting point is 00:10:40 but y'all, I always do my homework before I do these shows. I don't just randomly jump into something. I care about data privacy, security, how hallucinations, truthfulness, how the model works, right? So good thing, Google updated this. So if you're on a paid plan and you set up that cloud billing, Google's AI Studio does not use anything that you input to train its model. If you're on the free plan, it does.

Starting point is 00:11:02 But y'all, let me be honest. So much of the information that you're probably going to put into any chat about it's already on the internet. It's already public information, right? Let's just put that out there. All right. A lot of people don't understand how much of, you know, your company's data is already in large language models. If it's been posted on the internet ever before, it's in the models. All right.

Starting point is 00:11:23 So if you are brand new and feel free live stream audience to follow along as we go, the first time you go into Google's AI studio, you're going to get an option. Okay. And just so you know, it is AIS Studio.gov. Google.com. Okay. So it's going to say it's time to build and you can either use Google AI Studio and click that try Gemini. So yes, it is Gemini inside of AI Studio or you can do build with Gemini API. So if you do want to, you know, you can actually use Google's API key, which a lot of people didn't know you can do. So you can, you know, try it out that way. All right. A couple. housekeeping things to keep in mind. One, if you go up to settings, you know, you can do light mode, dark mode. But yeah, like I said, the user interface used to be a lot more geared toward highly technical people. Now it's easy to follow. It's so much cleaner now.

Starting point is 00:12:16 But you're going to want to turn on the auto save option. So go up to your settings, toggle that auto save on. Otherwise, by default to Google's AI Studio does not save your chats. Okay. Now, let's go into use case number one. long context window. All right, here's where we're going to get a little tricky. And apologies, y'all, for probably jumping around a lot of tabs here.

Starting point is 00:12:46 So what I have is I went through, and I'm going to share this with our live stream audience, podcast audience. This is one of those you might, you might want to watch the video on this. So if you're listening on the podcast, we always leave a link. You can go watch this video, whether it's on YouTube, LinkedIn, whatever. Okay. So I pulled about 50 episodes, transcripts. All right.

Starting point is 00:13:17 So from our AI news that matters. All right. It is nearly a 400 page documents. All right. So I'm going to copy this right here. All right. It might literally crash my computer just from copying some. much information. So why does long context window matter? And why is that my number one use case for

Starting point is 00:13:38 Google's AI studio? Well, you'll see here, even with Open AI's new 03 model, it can't handle this much context, right? So I'm telling it to at the very top of this prompt, I am saying, please carefully analyze this content and find 10 underlying trends that indicate trends from AI companies, large language models, generative AI, etc. So again, I have the transcript of our AI News That Matters shows. I think there's probably like 50 episode transcripts. So that's about 40 hours of transcripts content. So if I go into Open AI's new model, their O3 model, I love it. I think it's one of the most capable models in the world. I think Gemini 2.5 Pro is probably a little more useful right now, but I think O3 is probably

Starting point is 00:14:28 more capable. And I did full shows on each of those in the last two weeks. But I can't even put this full context in here, even with the extended context window of chat GPT's O3 model, right? You might be saying, oh, okay, Jordan, what about Claude? Claude has a long context model. Can't handle this. Can't handle this. So you'll see from a context, this is 250,000 tokens. All right. So more than a quarter million tokens. It's a lot of information, right, on near. 400 pages of transcripts. So Claude can't handle it. It says it's 40% over the length length limit,

Starting point is 00:15:05 so I can't do anything. Even Google Gemini, right, on their front end, because this is one of those reasons you might be thinking, okay, well, why would I ever use AI Studio? Can't I just use all the features and functionality inside Google's frontend chatbot at Gemini.com? Well, no, here's one use case already. I can't paste in all that information into Google Gemini.

Starting point is 00:15:28 It just cuts it off. But I can, obviously, in Google's AI Studio. All right. So let's see, let's see if I can do this live. If not, I do, I do have a screenshot. But let's see. All right. So I am going to copy and paste literally 400, 400 pages worth of context.

Starting point is 00:15:53 So now I am in Google AI Studio. I'm not going to go over the entire interface now because we're going to be doing some of these things later. So I'm going to choose, just so it's a little faster, I'm going to choose Gemini 2.5 Flash preview, which was just released about 12 days ago. So you do have on the right hand side, you have these different models that you can choose from.

Starting point is 00:16:14 And the good thing I love about this inside Google's AI studio, it gives you, so if you are a developer and you want to care, oh, how much might this cost if I put something like this out into production? For each model, you can just hover. and it gives you, you know, thinking input tokens, non-thinking input tokens, what is best for, use cases, the knowledge cutoff. I wish every single model had this. All right.

Starting point is 00:16:39 So I'm just going to go to Gemini 2.5 Flash preview. I'm literally just pasting in 400 pages of content. All right. I'm going to scroll up the top. So again, all I'm saying here at the top is please carefully analyze this content and find 10 underlying trends that indicate trends from AI companies, large language models, generative AI, etc. So I want to know essentially over the last eight months,

Starting point is 00:17:02 everything that's been happening in the world of AI news, what are the trends? I might know some of these things. I might not. And I am using Gemini 2.5 Flash preview, which is a thinking model. But I did use the Flash version, which is the smaller version versus the pro

Starting point is 00:17:16 because I'm guessing the pro one would have taken longer. And my gosh, this is done. That was like, no time at all. 26 seconds. It went through and it used reasoning. My gosh. If you're listening on the podcast, this is one of those ones where I'm just like smiling and I'm like, I can't believe this technology exists. Number one, I can't believe this technology is free. Insane. My gosh. Okay. So, and it did find 10 different trends. I'm just going to read just the headings, but it gave me, it gave me details, right? And the crazy thing is people are always like, oh, Jordan, how do you

Starting point is 00:17:56 remember so much on AI. Well, number one, I forget a lot, right? I forgot most of this stuff, even though it came from my lips, right? But so many of these things were back in like eight months ago. So I forget things. I use Google AI Studio every single day to remember and recall information. So let's just for fun go over the 10 trends. Number one, escalating competition in rapid model responses. And some very good details there. Number two, the rise of agentic AI and task automation. Again, we're looking at October 24 through April 2025. So yeah, back in, you know, October, there really wasn't a lot of talk on agentic AI or even task automation inside large language models. Now there is. Number three, advancement in specialization in

Starting point is 00:18:44 multimodal AI. Four, heavy investment in innovation in AI hardware and infrastructure. Five, diversification and experimentation with pricing in business models. That's a good one. Right. talking about like as an example, Open AIs pro tiers, Anthropic just came out with some new tiers. So yeah, there's details supporting all of this as well. Six, increasing strategic partnerships,

Starting point is 00:19:06 investments and acquisitions. Seven, growing focus on AI safety, ethics and governments in governance. Eight, rising regulatory scrutiny and geopolitical tensions, that's a huge one, right?

Starting point is 00:19:16 And to be able to know just over the course of five to six months, here's what's happening there. That's huge. Nine, transformation of user experience and accessibility. and then 10 significant impact on the workforce and industry structure. This is so good.

Starting point is 00:19:31 And because I used a thinking model, I can go in inside Google AI Studio and look and see how it went through all of this information. So extremely impressive. I can go see how the model thinks, see how it planned, and see how it identified these 10 different trends. And yes, it would have been better if I use Gemini 2.5 Pro, but it probably would have taken another minute or two, and I'm trying to get through these different use cases a little quicker. All right. Yeah, Angie just says, whoa. Yeah, I agree. Curtis from YouTube says,

Starting point is 00:20:09 those little model cards are so helpful. Yes, I agree. It's, I wish, number one, I wish that that was also in the front end of Google Gemini, but I wish everyone did that, right? Those model cards are so, so helpful, right? Especially if you're a beginner. right, and you go down to Gemini 2.0, right? Because there's actually some features that are only available in Gemini 2.0 inside Google AI Studio, not in the newer Gemini 2.5. You have your Gemma models. So extremely, I think probably still the world's best small language model in Gemma 3.

Starting point is 00:20:46 But then you could also go and play with some of these preview models, which is the LearnLM models. All right. So that's number one. You'll see already. That's something that no other platform could take a million tokens, right, in the front end. It just, that's, that's, that's nutty. So think of all the use cases, and I'll tell you what I do for this all the time. So, yes, you can do something very similar in Notebook LM, right?

Starting point is 00:21:14 You know, dump a bunch of contexts in there. And this could fit inside Notebook LM. But one of the reasons is I still like sometimes having flexibility outside of your grounded information. So as an example, and let me just do this quick. All right. So, all right, we're still on use case number one. But there is this new thing grounding with AI search. So now I can use information that's not in that 400 pages, right? Whereas on Google's notebook LM, as powerful as it is, and it literally won our 2024 AI tool of the year. But now I can click this new option grounding with Google search. Right. And I can say something like, you know,

Starting point is 00:21:54 based on April 2025 movements in the AI space, please bullet point what's new for the above 10 trends. So like as an example, if I just wanna see what's happened this month with those 10 trends and to see where they're at in this current day, because some of them are reflecting back six months or more, I can ground this in Google search and then I can go see exactly what it's doing.

Starting point is 00:22:23 So now it's just going to get information from April 20, 25 only. And it can go out and search and use the web. So very cool. All right. Use case number two, video creation. All right. I'm going to try to do this live. Let's see how it goes.

Starting point is 00:22:44 All right. So people don't know. You can create videos inside of Google AI studio. So one thing, I'm technically now starting inside of Gemini advance because I want to get a photo first. So I'm going to, unfortunately, I don't think right now inside AI studio, you can create images, but you can create videos with their new VO2 model. So I am in Gemini right now, and I'm saying create an overhead aerial photo,

Starting point is 00:23:15 realistic shot on full frame DSLR from above a commercial airplane flying over the Chicago. skyline. All right. So now I, again, I am in the front end of Gemini. I don't necessarily like this. This isn't what I was going for. It only, it only got the wing there. I wanted to be able to see the entire, the entire plane. So I just clicked the redo. I already did this once. It was actually a really good photo, if I'm being honest. But I wanted to be able to see the full airplane overhead. If I didn't, if, if I can't get exactly what I'm looking for. I did run this once this morning and I'll use that image, but I was trying to see if I could get a nice one. All right, it did the same thing, but that's fine. So anyways, you know you can pull this into Google AI Studio. All right. So now I'm jumping back

Starting point is 00:24:07 into Google AI Studio. And I'm going to go on the left hand side, you have this new video gen. All right. So VO2 is so good, right? I'd still say even though we have some new updates as an example, runway, Gen 4, we've had some very impressive offerings from some of the Chinese companies, Kling as an example, all the different AI video companies. I still think VO2 is ahead.

Starting point is 00:24:37 It's ahead of SOAR. It's ahead of everyone else. All right. So, FYI, Google is bringing VO2 if you have a paid, version of Google Gemini. And it's a slow rollout because you'll see right now on my different, I have four different paid Google accounts. I don't have V-O-2 access inside of Google Gemini right now. So the Google Gemini chatbot, Gemini.com. But if I go into Google AI Studio, I do have it.

Starting point is 00:25:02 So on the left hand side there, you'll see I have Video Gen. Okay. And the good thing is inside Google's AI studio, there's a ton more option. So on the right hand side, you'll see, I have VideoGen. So on the right hand side, you have all these different settings. So I can create a 16 by 9 video. I can create a vertical video, nine by 16, you know, anywhere from five seconds to eight seconds, I can change the frame rate. I can have a set the resolution or run a negative prompt, which is if something keeps popping up in my AI video generation and I don't want it to, I can write it in the negative prompt box. But the good thing with Google's AI studio and using VO2 there versus, yeah, you can use it in the more enterprise vertex and get even more options.

Starting point is 00:25:43 But the thing I like inside Google AI Studio is you can start with an image because unfortunately, if you're using this inside Google Gemini, the last I checked, you aren't able to start with an image. So this is great. So I uploaded the previous image that I had and I have a little prompt here. So I'll read it out. So all I'm saying is cinematic panning shot showing the airplane slowly moving from right. to left across the Chicago skyline. All right. And here is the image that I ultimately used.

Starting point is 00:26:19 So I don't know. LiveSream bodies, does this look like a real image, right? Google's Imagine, Google's Imagine 3 AI photo generator is really, really good, right? If you look at like LM Arena, which, you know, we might as well just do that now because it's going to be a second for this to generate. But if you look at LM Arena, and if you look at LM Arena, and if you look at LM Arena, and if you you go to the leaderboards and you go to, let's see, text to image.

Starting point is 00:26:49 So this is the different text to image. Like imagine three from Google is the top AI image generator, at least that they have access to inside this program. All right, so let's see here. Hey, this wasn't too bad. Michael said video generating on a live demo. Jordan is brave. Yeah, I know, right?

Starting point is 00:27:09 All right, but it looks like it's already done. All right. So I'm going to go ahead, put this full screen. So we have an eight second video here, live stream audience. Let me know how does this look. Again, so we started with the image in Google Gemini, brought it over to Google AI Studio, using the video gen on the left hand side inside Google AI studio. And this is all for free.

Starting point is 00:27:31 Oh, this is wild. This is wild. All right, here we go. Here's the photo. Adobe just introduced an entirely new way to create, bringing the power and precision of its creative suite into one conversational. experience. Meet Firefly AI Assistant, now live in the Adobe Firefly app, the all-in-one creative AI studio. Powered by Adobe's creative agent, Firefly AI assistant lets you start with your vision, just describe what you want, and shape the outcome as it takes form with the assistant.

Starting point is 00:28:04 The assistant orchestrates multi-step workflows drawing on 60 plus pro-grade tools across Adobe Creative Cloud apps, including Photoshop, Illustrator, Premiere, Lightroom Express, and more to help bring your ideas to life. You can also get started with creative skills, a growing library of pre-built workflows for common creative tasks, like batch editing photos, creating mood boards, portrait retouching, and creating social variations. Every step the assistant takes is visible so you can refine, redirect, or take over at any time. You stay in the driver's seat as the creative director. Adobe Firefly AI assistant now in public beta. See it today at firefly.adopi.com. or the video. Not bad. Not bad. So pretty good. There were some, some, some, some,

Starting point is 00:29:01 some, some, some, like, artifacts right here. There's, there's, there's like some little blur here. But again, you can just, you know, you can just regenerate, you know, and the good thing is, it's just using natural language, right? I just said cinematic panning shot, showing the airplane, slowly moving from right to left across Chicago skylight. So, like, what could you use this for? I mean, my gosh, your company's marketing is probably still stuck in the 1990s. It's old, it's dry, install. So let's say as an example, your big annual conference this year is in Chicago, right? If it is in Chicago, by the way, reach out to me. I'll come, I'll come a keynote your conference and run a workshop. But, you know, maybe you're trying to put together a nice looking,

Starting point is 00:29:41 you know, landing page for your annual conference, for your big company's annual conference in Chicago. This would be a great, I think, you know, get this like three second video and put that as a, as a background on the website, right? It's great. You can tell if you're from Chicago, you can tell it Chicago, right, for our podcast audience. It looks really cool. I mean, to be able to get a shot like this, right? You would have to be in another like airplane because you can see the entire airplane

Starting point is 00:30:13 flying over the Chicago skyline. So very cool, right? Is this going to, you know, be in the next Netflix documentary? No. but again, I essentially just, you know, two shot at this, right? I took two attempts at it. It's really good. It's really good.

Starting point is 00:30:33 Yeah, live stream audience, what do you think of this video? I think it's like in terms of being like putting up on your website as a background video, you know, putting it on social media, right? You could throw it in Canva, something like that, add some effects, add some words. In terms of you now have this used to take so much time, right? To get a video like this five years ago would be nearly impossible and extremely expensive. Now, in terms of your marketing, your comms, your advertising, using Google's VO2, especially if it's just going to end up, like, as a smaller something on your website, social media, et cetera.

Starting point is 00:31:13 Again, I don't think you would want to use this at least right now, right, in a feature film or something like that. But it looks extremely realistic, especially on a smaller screen. if you're only doing a couple of seconds, it's very good. And where this takes your creativity, like next level, next level,

Starting point is 00:31:30 yeah. Angie just says, I can't believe it's free. Jay says, actually at an annual, oh, Jay, I suck. I know you texted me.

Starting point is 00:31:37 I got to get back to you. All right. So Sandra says, I'm actually doing it on my computer as we're speaking, and it's super amazing. Yeah, it's so,

Starting point is 00:31:46 so good. I think the physics and the realism, specifically the physics, is what separates V-O-2 from Google. ahead of SORA. SORA is great, but it struggles with physics. I think there's some creative tools inside SORA from Open AI that I really like, but it struggles with physics. It struggles with realism. Google V-O-2 crushes it. It is absolutely so good. Yeah, Denny says good for background image that might have an overlay. Exactly. I would put this if I was having a big conference in Chicago,

Starting point is 00:32:17 right, and you're selling tickets or whatever it is, you know, you try to bring thousands of employees together for whatever, right? Or just, you know, maybe your, your website needs a refresh. Go go do it like this, right? It's really good. All right. So that is use case number two. Let's go to use case number three.

Starting point is 00:32:34 I've done this before, uh, but parsing PDs. Okay. And I know this might sound like a boring one and not super impressive. All right. But it is. Uh, because again, up until, uh, the most recent line of models from both Google and from OpenAI. This was something that large language models really struggled with, right?

Starting point is 00:32:59 You know, using a combination of OCR, all right? So that is OCR is optical character recognition. So, you know, these models would use a combination of OCR and computer vision and it wouldn't always work. All right. So let's go again. We're doing this live. I did do this one other time, but I want to include this.

Starting point is 00:33:21 on today's show because I think it's extremely impressive. All right. So let's jump over. We're doing this live. We are in Google's AI studio. All right. There we go. And all I'm going to say is transcribe this PDF word for word.

Starting point is 00:33:39 All right. And for this one, I am going to go with Gemini 2.5 Pro, the most powerful model. All right. I'm going to click run and we'll see how long this takes. And first of all, you might be thinking, okay, why, why might I want to use this? All right? How many times have you gotten a super long PDF? Um, and maybe you have to make updates or you're like, my gosh, I have to read through

Starting point is 00:34:03 this entire thing. Uh, and it's, it's going to take forever. And maybe I just need a little bit of information out of here. Uh, so in this case, and a lot of times, different tools struggle reading PDFs, right? Especially, uh, you know, I'll show you all an example of, of, of what, uh, you know, this PDF is going through and you'll see why this could be extremely helpful, right? So this is going through. This is kind of our sponsorship opportunities deck.

Starting point is 00:34:33 So if you look through this, it's images, right? Any tool would struggle to read this. I created this in Canva. There's a bunch of images. A lot of this stuff that looks like text isn't even text, right? It's literally a bunch of images. any program would struggle. So, you know, when companies reach out and they want to advertise with us, you know,

Starting point is 00:34:55 I send them this little deck over. So I want to pull up this page. So at the bottom, it says trusted by leaders from, right? So we have, you know, hundreds of, you know, subscribers from big companies like, you know, Google, Amazon, IVIDia, Microsoft, IBM, Intel, et cetera. So we have their logos at the bottom of this deck to tell people, yeah, if you want to reach our audience, you have a lot of, you know, very reputable people who are building the future of AI.

Starting point is 00:35:20 you know, um, reading our platform. All right. So that's a logo. Okay. So now it's done. All right. Uh, so let's see how long that took. Didn't take long.

Starting point is 00:35:33 So again, you can go in if you're using these thinking models, these reasoning models, you can go through. It took 53 seconds. All right. And here's the thing. It got it done perfectly. Perfectly. It got all those stats.

Starting point is 00:35:46 It pulled all this information. Uh, it pulled all this information. from, you know, images that would be very hard for any OCR or computer vision to do, right? So in a lot of these pages, it's just like 10 images and maybe a little bit of text. So right here, it even pulled the logos. These logos are not labeled. Let me repeat that. These logos are not labeled.

Starting point is 00:36:12 So, you know, as an example, you know, the Amazon logo. it's just an A. It's just an A with the little, you know, smiley face, right? If you know Amazon's logo, you probably do. It literally knows that that's Amazon's logo, even though you, the word Amazon isn't on there. So do you see the utility and how this can be a huge time saver? So maybe you have to update a PDF and you're like, oh, my gosh, who has that original file? I'm going to have to read you all of this.

Starting point is 00:36:41 Maybe you need to read information from a PDF. And it's a long, you know, 100 page, you know, industry white paper, but you only care about 10% of this. Well, Google AI Studio is a great place to do that because when you combine this very large context window with the fact that it is one of the smartest and most capable models in the world, I mean, you have a tool here that is almost unfair, right? And I'm not even going to get into the more technical aspects of what you can do. But you can even, if you wanted to, you could create like a version of Google gems, right? Or, you know, if you've used custom GPTs inside of.

Starting point is 00:37:18 of chat GPT. So I could save, let's, let's take that industry white paper example. Let's say every single week, there's a huge industry PDF that comes out. It may be a big part of your job is reading through it, pulling out some of the key insights, and then briefing your team, and maybe only 10% of it pertains to you, right? That's probably something a lot of you do, but you could essentially go in here. You could create essentially a version of a Google gem. It wouldn't be called that, but you can put in custom instructions. You can also go in here if you wanted to get a little more technical.

Starting point is 00:37:52 You could even turn on the structured output. So maybe, hey, in each week, there's always a table or a chart. And maybe there's eight columns, but you only want three of the columns because that's all that matters. You could go in. It's very easy, right? You could essentially give examples to Google AI Studio and then create that. And then, you know, it would save because you can save. because you can save it with a system prompts,

Starting point is 00:38:17 and then you can toggle the option on for structured output, right? So I can go in here, I can go into this visual editor, and I can add a property. So, you know, and then you can export this as what's called JSON. All right. I know that sounds a little technical, but essentially you can, you know, kind of save different types of prompts and train it on input outputs.

Starting point is 00:38:44 But you can just do it. it in natural language. All right. What do you guys think about use case too? Richard says how about had that been written text in a PDF? It would have been done much faster, much faster. Yeah. Curtis says it'll even accurately extract text from a PDF that has wonky OCR text in it.

Starting point is 00:39:08 Yes. It is so good. Allison says, I pay for acrobat pro and still receive this error every. time I have I haven't used OCR if there's a single rasterized image. Yes, exactly. It says, oh, yeah, acrobat cannot run OCR as the page contains renderable text. And then she says, AI studio just did this successfully in five seconds. Yeah, it's, it's nutty, y'all. This is so good. Are you guys, again, this is free. The fact, this is free and we're only halfway through our use cases. That was number three. All right. Number four, multimedia content creation. Okay. So, this is one

Starting point is 00:39:44 of those. I'm going to go ahead and jump over into my screen. Again, podcast audience, we're doing this live if you want to see. All right. So for this, for this example, I'm actually going to use an older model. All right. And I do think and hope that they're going to update which models have this capability. So for right now, I'm going to go to my model selector. I'm going to go to Gemini 2.0. and then I'm going to go to this flash image generation. All right. So there's actually a lot of things that I wasn't going to do this, but I saw someone in the comments had asked for it.

Starting point is 00:40:21 So I'm just going to do an example of this. This isn't even one of my use cases. But let me see here. I'm going to try to give me a second here. All right. I'm going to try to upload an image. Let's see. Give me a second, y'all.

Starting point is 00:40:38 I know this is doing these things live. Never fun, never fun. All right. Let me, okay. So I'm going to pull in this same photo that we used. Okay. So this new, you know, and it has the word hot on it, right? If you hover over.

Starting point is 00:41:00 This is essentially, you know, we've been hearing a lot about the GPT40 image gen. Amazing. Right. So Gemini 2.0 Flash, the experience. experimental version and it has image generation. This isn't my use case, but someone asked for it. So I'm going to see if I can do it here live on the fly. So I uploaded that same Google Gemini image of the plane flying over the Chicago skyline.

Starting point is 00:41:26 So let's just say, I'm going to try this. I'm going to say keep everything else the same and turn the and I'm going to say, but make the plane bright yellow with. the words Jordan air on the side. All right. I don't know if this is worth, if this will work, hopefully it does. But you know, I've been mentioning on this show with between what it's done already.

Starting point is 00:41:55 All right. It didn't do a great job. It actually added an additional plane. So I probably should have done, I probably should have done this little better. I'm going to say, uh, there should be only one play. All right. We'll see if that, uh, if that works.

Starting point is 00:42:11 So I've been talking about on the show between GPT40 ImageGen, between this new thing from Gemini 2.0 Flash, whatever, like everything that, you know, Canva announced. I think there's going to be a lot of pressure on Adobe because this is like, if you get it right, this is like Photoshop. This saves so much time. All right. So there we go.

Starting point is 00:42:37 It did it correctly. It changed the perspective a little bit. it made the plane slightly larger. So let's even say that took five seconds. I'm going to say, you know, the plane is too large, make it more like the original image. Keep the one plane yellow with Jordan Air on the side. All right. If I was taking a little bit more time, I think I could get this really well.

Starting point is 00:43:04 But y'all, I can't tell you, right? I've been using Photoshop since how old am I? I'm a little old. I've been using Photoshop for 20, 23 years, almost a quarter century. I've literally wasted thousands of hours that now you can get a pretty good job done inside Google's AI studio. All right. So that one didn't work perfectly.

Starting point is 00:43:28 It added some fragments of another image. That one was a little complicated, but go in and play with that. But that's not even the use case. All right. So I got distracted. So let's get back to the use case, which in this case, it is. doing multimedia content creation. So again, I'm inside Google's AI studio.

Starting point is 00:43:46 On the right hand side, I chose Gemini 2.0 Flash. And you'll see the big thing here is it has this image and text outputs. All right. So all I'm going to say is, you know, write a blog post on the top five tourist spots in Chicago and create photos for each of them. All right. That probably just to type that, FYI, probably talk. took me 12 seconds. Let's see how quickly Google Gemini can create this. So, all right,

Starting point is 00:44:17 it's at, it's, let's see, is it done yet? Not yet. Okay. So you'll see, it's literally writing me a blog post, but it's giving me the top five must see spots in Chicago. But for all of those five in line, it's giving me photos. It is using imagine, which I'm actually curious if this is using Imagine 2 or Imagine 3. My hunch is it's using Imagine 2 because I've run these exact same prompts inside Google Gemini with the Imagine 3 and in the Imagine 3 interface. And the quality is a little better. So I'm not sure.

Starting point is 00:44:58 I'll try to get that verified, but I believe if you're using this, I think it's using Imagine 2. But regardless, I mean, y'all, look at these photos, right? The fact that I just, quote unquote, in 23 seconds, right, I have a blog post, let's say you're a tourism company and you have a new tour here in Chicago with the five, you know, the five, you know, hottest spots in Chicago for tourism. The blog post is written. The images are there, right? Are these the most high quality images? Absolutely not. But they're really good, right?

Starting point is 00:45:33 Hey, another secret here. And what do you think? what do you guys think of this this right here i don't think any other large language model any popular one by default anyways can do this right literally i just in theory quote unquote wrote i would want to go through human loob spend more time on the front end spend more time on the back end but i have a blog post here with five photos that google gemini generated i used to do a lot of you know SEO content creation and i've been doing that on the web probably probably since, I don't know.

Starting point is 00:46:08 I mean, I built my first website in the 90s. I was paid to build some of the first websites I did in, what, like 2010. So I spent a good three, four, five years, mainly doing this, blog posts with images for a lot of clients. It just did it in 20 some seconds. And it's really, really good. All right, here's a little cheat code, right? Let's say you accidentally go into the wrong mode.

Starting point is 00:46:34 So let's say you go into Gemini 2.5. Pro and you run this exact same check. Okay. It's obviously not going to output images, but what's really cool because it's a thinking model and it's like, yo, I can't create images. You'll see this because I always like to test this. I'm like, okay, what happens if you, you know, ask a model inside Google AI studio to do something that it is not capable of? And I can obviously read its kind of chain of thought to see how it's processing this. Let's see if it did it.

Starting point is 00:47:12 Here we go. So what it does is it's like I can't generate an image, but it gives you a very detailed prompt. And then I can go into and then I can go into Google Gemini and just run that prompt. So I just said, create an image four. And I copy and pasted the image description that it gave me. Google Gemini 2.5 Pro realized, hey, I can't create an image even though you asked me to. So instead, here's an image description that you could put inside.

Starting point is 00:47:47 And yeah, this one, again, this photo is much better than the one that was generated inside the multimodal version inside Gemini, Gemini 2 Flash. So I do think or I assume it's a better image generation model. All right. We got more, y'all. But I made a mistake. So I made a mistake. There's actually an extra one that I wanted to do that I think is better than the rest of these. I know there's still a couple between all our different platforms.

Starting point is 00:48:26 There's more than 100 people that are still around. So if you want me to get to the sixth one, just say the word bonus. All right. sometimes I like to have fun, see who's still here paying attention. So if you want me to do the sixth one, because actually the sixth one, initially I said, oh, here's five time consuming tasks. And then this morning when I was going over my show, I'm like, wait, I didn't put the newest and most powerful one in this list.

Starting point is 00:48:51 So if you want me to do the sixth one, just type the word bonus. If we get 10 people, I'll do it. If not, maybe I'll share about it in the newsletter. All right. So just put in the word bonus. If I get 10 people. All right. So here we go for number five.

Starting point is 00:49:03 This one might get a little tricky. So I'm sorry. Also, live stream audience, aside from typing in the word bonus, if you want to see this, I'm going to need some help on this because I want you all to hear this one. Because number five is learning any new skill via share screen stream. Okay. And this is one of those. This is one of those that, you know, open AI previewed this almost a year ago.

Starting point is 00:49:31 And we still don't have it on desk top. But right now, Google Gemini, again, for free, can see your screen and you can interact with it. Let me repeat that. For free, Google Gemini's AI Studio can see your screen and you can interact with it, all right? Yes or no, be honest, y'all. How many of you knew that? How many of you all knew that? All right.

Starting point is 00:50:01 So I'm going to get something going here. I hope that you can hear it. I'm going to have to move my mic. I've done some testing with this. Even when I share, even when I share the audio, you can't always hear it. So number one, live stream audience, can you still hear me? All right, because I move my mic away. All right.

Starting point is 00:50:22 And then I'm going to go ahead and stop sharing this. And I'm going to go over. I'm going to go over. this use case, which I think is amazing. All right. So give me a second, y'all. Doing this live, it's dangerous. It's dangerous here.

Starting point is 00:50:44 Okay. So let me first describe what I have. Okay. I have a simple spreadsheet from my Google search council data. And of the 10 pages with the most clicks, I exported that and put it into a Google sheet. Okay. And what I want to know is I want to know the difference in impressions over the course of the last seven days. All right. So did this go up 10%? Did did it go, you know, down 30% right in terms of impressions? So these are some stacks that I'm pulling in. And so essentially what I want to do is I need to write a formula. Okay. And as much as I love Google Sheets and Excel and everything else, does anyone struggle with formula? just me, anyone else.

Starting point is 00:51:35 All right. So now I'm going to go into Google's AI studio. All right. So that is the sheet I am going to share. All right. And hopefully you can hear this. So you might not be able to. I apologize.

Starting point is 00:51:51 I'm cranking my volume all the way up. So here we go. So on the left hand side of the screen, I'm clicking the stream icon. So this I can just talk to, Gemini live. I can show it my webcam so we can see me or if I was trying to explain something because this also works on mobile, which is really, really cool. Or I can click ShareScreen. So this is what I'm going to try to do. And again, this is something that opening I demoed, I think May of 2024,

Starting point is 00:52:23 and we still don't have it on desktop. All right. So I'm going to click Share Screen. Hopefully you can hear. So I'm clicking Share Screen. And then I have to choose. I have to choose which one we're going to do. Hey, Gemini, can you hear me? Yes, I can hear you. How can I help you today? Cool. Can you tell me what's on my screen? On your screen, I see a Google Sheets spreadsheet titled EDAI Last 7. The spreadsheet contains data organized in columns such as top pages, clicks last seven days, impressions last seven days, impressions last seven days and difference. Do you want me to describe any of the data in more detail?

Starting point is 00:53:08 No. So I understand the data. So in this column here, H2, I want to write a formula that shows me the percentage change in this impressions difference between the, you know, column E2 and F2. So G2 gives me the number, but I want to turn that into a percentage. And I'm not sure the formula that I need to write in H2. So can you tell me the formula? To calculate the percentage change in cell H2, you can use the formula equals G2E2.

Starting point is 00:53:52 After entering the formula, format the cell as a percentage to display the result as a percentage value. Will that work for you? All right. Yeah, cool. Give me a second. So I'm going to highlight. So you just gave me this formula here inside the Google AI Studio chat. So I'm going to copy that.

Starting point is 00:54:09 Give me a second here. And I'm going to go over. I'm going to insert it into the formula bar. Click Enter and there we go. Okay, cool. So now it gave me like a fraction. So that's fine, but I wanted to show up as a percentage. How can I change this new column, the H column, to a percentage?

Starting point is 00:54:35 I understand that you want to convert the values in the H column to percentages. To do that, select the entire column by clicking on the, the column header. Then, in the toolbar, find the format menu. Within that menu, there should be an option for number or format as. Choose the percent option from the drop-down menu. This should format all the numbers in column H as percentages. Does that make sense? Cool. All right, did I do that right? Can you look? Did I get it correct? Yes. The H column now displays the values as percentages. You did it correctly. Is there anything else I can help you with.

Starting point is 00:55:14 Yeah, maybe just out of curiosity. And if you could be just very direct and blunt and act as kind of an SEO strategist. So, you know, these are all website pages from my, my website and some Google Search Council data. What should be some of the first things if I want to increase traffic to my website by looking at this screen? Where do you think my best opportunity is or what should I go do right now? From the data, it appears that the web page, free chat GPT versus chat GPT plus what's the difference, is experiencing a significant decrease in impressions and clicks. To increase traffic, you should investigate why that page is experiencing such a large drop.

Starting point is 00:56:05 Could you tell me about any changes you've made to this page recently? All right. So now what I'm doing is I stopped, I stopped that. That's, what do you guys think? Was that pretty impressive? All right. Angie just said mine alone. And the good thing is, which I didn't have this enabled. I could have done grounding with Google search, right? And then it could have planned that episode for me, right?

Starting point is 00:56:35 Which is really, really cool. So do you guys understand the magnitude of that? I just gave a simple example, right? Okay. I would have probably, you know, that was a very easy, you know, formula to write. Let me be honest, it wouldn't take me that long. But maybe if I was trying to do something more advanced, probably would have taken me, I don't know,

Starting point is 00:56:59 five, 10, 15, 20 minutes to go find something, right? When I can give Google's AI studio and Google Gemini access to my screen, I can become an expert in anything, especially with this new option that no one's talking about, that you can ground with Google search. Because at the end, it correctly identified that one of my, articles on my website, which brings in the majority of the traffic, it's losing ground. And the reason why is, well, that article is a little old.

Starting point is 00:57:32 So it's probably about six months old. And there's a lot of things in the new chat, GPT, free versus, free versus paid that have changed. So it's losing ground. So if I would have had grounding enabled, I could have talked with Gemini to help me plan that update. It could have looked at that webpage, said, okay. here's some new updates that have happened with chat gpt that are not in your article yet so all of a sudden if you use this correctly think of it think of like you now have the smartest

Starting point is 00:58:05 person in the world over your shoulder you can ask them anything anything and they're going to get it so you know you might be trying to learn a new piece of software right maybe it's something like I showed you, an Excel sheet, maybe you're coding something. Maybe you just want feedback on a design, right? And it's actually really good. So you do need to do a little bit of prompting. Sometimes the live talking models are little verbios. So I always like to tell it to be direct, you know, to never be wishy-washy or politically correct. Always, you know, give me hard hitting and actionable advice because sometimes, and that's all large language models, they might give you a bunch of fluff. So you can still do a little bit of best practice prompt engineering

Starting point is 00:58:49 and really kind of, you know, give it a role, give it a task, you know, have a conversation with it, iterate, evaluate, et cetera, and you're going to get great results. But you can literally learn anything, anything new that you're trying to learn, you can do it. This, I don't know why more people aren't talking about this and using it, especially the fact that you can now combine it with this grounding with Google search, y'all, y'all. Angie says, I'm going to spend all day doing this. McDonald said, I did not know that. Fred, good question.

Starting point is 00:59:27 Fred says, can it make the changes you requested to the spreadsheet without you having to do them? So I don't think so. Although, Fred, strangely enough, I ran this simulation a couple of times. Now, one time Google Gemini did say, hey, do you want me to put that formula in there. And I said, yeah, sure, it didn't do it. But, you know, maybe. Monica said, I fought with a spreadsheet for hours last week. So yeah, maybe now you can go fight less. Big Bogey says, how is this free? Yeah, good, good stuff. So yeah. All right. Let's see. Did enough people want the bonus one? Or should I end this? Let me see. How many people said bonus? One, two, three, four, five. I don't know. Maybe. Let's

Starting point is 01:00:11 see. All right. Well, at least let me stop sharing this screen. Take a drink. We're going to go. Angie, Angie says next prime prime polish with Gemini. Maybe. All right. I think I think we got 10. I was trying to count. I won't be a strickler, but okay, yeah, we got. Okay. Now that I'm scrolling up, it looks like the word bonus showed up way more than 10 times. All right. I know it's already been an hour, but let's, let's do it. Here's number six. In this one, y'all, is even crazier. All right. I am not kidding. All right. Let me get my, gosh, where's my link there? There it is. Okay. All right. So give me a second. Number six is better YouTube learning. Stick with me here. it's more than you think.

Starting point is 01:01:11 All right. So what I'm going to do. All right. So I had this video. When was this video? Give me a second. Okay. This video was from seven months ago.

Starting point is 01:01:25 So it's a little old. It is a nine minute video. And it's going over. Oh, gosh. Looking at my hair on this video, I need a haircut right now. Looks like I just got a haircut there. So it's a nine minute video and the video is Google's notebook LM, the AI tool you can't ignore. So I think this was my second video that I did on notebook LM.

Starting point is 01:01:51 Okay. So all I'm doing here is I'm copying the URL. This is a nine minute video. Okay. So now I have my screen sharing going on here. Okay. So this is new. This is new.

Starting point is 01:02:04 So at least using it in the Gemini 2.4. So one thing I didn't talk about is there is this little plus button. If you go over in the middle by the run button. Okay. So if I click the plus button inside Google AI Studio, there's different options where I can start with different media. So the other amazing thing about Google AI Studio, it's multi-modal by default, but it can ingest video.

Starting point is 01:02:37 but not in the way that you think. And let me show you that and let me prove that. So again, when you think of this large context window and the fact that Google's AI studio can technically see this, let me not exaggerate here. Actually, I'm not going to say anything. I want you all to see it live first. So I can click, when I click that plus button,

Starting point is 01:03:00 I can click my drive, upload file, record audio, camera, YouTube video, sample media. Okay. So all I'm going to do here is, I'm going to click YouTube. I hope this works. And then a box pops up. It says YouTube video URL.

Starting point is 01:03:12 I'm going to paste it in there. It loads the thumbnail, which these thumbnails are silly. But someone from Nvidia told me that my YouTube was lame and I had to make it better. So there's a thumbnail, I guess. So all I'm going to say is what do we want to say here? Give me a second. So I'm just going to type in. So this is a video about notebook LM.

Starting point is 01:03:36 So I'm going to say bullet point, bullet point the steps on how to use notebook LF. Okay. And then I'm going to click run. And you'll see right here, it gives me a token count. So this nine minute video is 157,000 tokens. So I'm going to click run prompt. Okay. And again, I'm using Gemini 2.5 pro.

Starting point is 01:04:01 So presumably this is going to take a little longer. All right. So Jackie says Google owns YouTube. So this makes sense. Will it work with other video channels? Great question as we wait, Jackie. So yes, you can also just upload a file if you have a video file. Now, as we wait for this, and I'm going to go ahead and click this.

Starting point is 01:04:26 And you can see in the chain of thought, Gemini 2.5 Pro is thinking it's going through. It's crunching this video. And it's almost already done. Okay. This is good. Okay. So yeah, it's already done. So it took 42 seconds and it gave me bullet points on what I talked about in this video. So number one, access notebook LN, go to the notebook LM interface, create a new notebook. Add your sources. Process the source. Locate audio overview. Um, generate the conversation. Okay. So pretty cool. So it just broke down a nine minute video, uh, into bullet points. All right. And I probably, I'll get what you're saying like, okay, Jordan, what's the big deal? Why is this changing how we work? Okay? Let me, let me do something else.

Starting point is 01:05:22 So I'm typing, describe the person in the video. Hmm. Because there's so many AI tools that you can throw in a YouTube link and, you know, what it essentially does is it goes through the transcript. Okay. And then it's like, hey, according to the transcripts, here's, here's what, here's oh gosh hey at least this okay whew all right that's that's good so y'all look at this based on the video here's a description of the person appearance he is an adult male appearing

Starting point is 01:06:00 to be Caucasian likely in his 30s or early 40s out google outch uh hair uh let's see all right hair, he has dark brown or black hair. My wife will love that one. We always argue about my hair color. I like to pretend that my hair is brown, but it's probably black. Hey, and at least Google was a little nicer here, and it didn't mention all of my gray hair. But facial features, clean-shaven, attire. Look at this.

Starting point is 01:06:33 Attire. He is wearing a dark color, likely black or dark navy, short-sleeved polo shirt. The shirt has thin white stripes along the edge of the collar. A small light colored logo appears to be CK, visible on the left side of his chest. Y'all, look at this. You can barely even see that, right? So apparently I have a, looks like a Calvin Klein polo right there. That little CK is tiny.

Starting point is 01:07:02 But do you understand and realize what is going on here? Okay? Think of how this can change the work that you do. Let's say, I don't know, for an easy, an easy example, let's say you work in a factory. Okay. And, you know, whatever it is, you process, let's say you make these mugs. Okay. I have a glass mug here from a espresso.

Starting point is 01:07:33 Like, Nespresso, I've named dropped you like 50 times. Why do I not yet have a lifetime subscription to Nisproso, right? Let's say your company makes these Nespresso mugs, right? And you have video that is monitoring and let's just say it's a 30 minute video. All right.

Starting point is 01:07:56 And a lot of things go wrong during this 30 minutes, right? You're doing a tour, some things knock off. One of the pieces of the machine, mouth functions, right? A couple of the glasses come out wrong. Whatever. I'm trying to give you an easy example on something that I saw. Google Gemini can go through

Starting point is 01:08:13 and do all of this and say, hey, how many glasses fell off the conveyor belt? Give me the time and then give me a classification on what caused it to fail, right? And then remember, you can have these structured outputs as well as system prompts. So let's say during that 30 minutes, 100 glasses fell off and, you know, a thousand went through successfully. So, you know, it was a 10% failure rate. So you could go in there, create a system prompt using structured outputs, where it essentially kicks out, you know, a spreadsheet looking output that gives you the timestamp, what happens,

Starting point is 01:08:58 and it can classify it as well. Was it human error? Was it a machine error? Was it, you know, faulty glass, whatever? You can create those classifications. Google's AI studio can go in and essentially watch 30 minutes. of, you know, let's say it's security video, you upload it there. My gosh.

Starting point is 01:09:19 Live stream audience is no one's, is no one's mind blown? Anyone else? You know, we just have some people saying, this is crazy. Y'all, think of the new possibilities and just how this changes the future of work, right? A lot of people, if you are in a role where you're observing, something all day and you have to report back, you have to classify this information, y'all. Google AI Studio does that, right? Can you upload an eight hour file right now?

Starting point is 01:09:55 Probably not. But it's getting, it's getting there. Right. So it's even describing my demeanor in this video. It says he appears engaged in expressive as he talks, looking directly toward the camera slash viewer. He smiles and shows enthusiasm while disgust. the topic. My gosh. Like the fact that this technology is available today, it's available for free,

Starting point is 01:10:24 and it can completely change the way you do business. Period. That's a wrap, y'all. All right. Let me quickly recap our six time-consuming tasks that you didn't know you could automate or run inside Google AI Studio. Number one, long context window. Number two, video creation. Number three, parsing information from PDS. Number four, multimedia content creation. Number five, learning any new skill via screen, share screen, stream, and our last bonus one, better YouTube learning and just being able to literally have a large language model, be eyes for any video, and to be able to class classify information, much more than a transcript. It can see and use a brain.

Starting point is 01:11:22 It's wild. That was a wrap, y'all. I hope this was helpful. All right. But there you have it. If this was helpful, please share this. I would appreciate if you share this. If you tag someone that needs to know this,

Starting point is 01:11:38 I think Google's AI studio is getting absolutely slapped on. Like I said, a couple of these updates. I don't think what I just showed you, the ability to upload a YouTube video in Gemini 2.5 Pro, I was using prior models before this. I think that's brand new, right? And again, it's not like the thousands of other AI tools that do that, that just look at the text, they look at the transcript, they look at what was said, they look at the words. No, this can see, it can use its brain, and it can use the Gemini 2.5 Pro model. So if this was helpful, please let me know, repost this, share someone.

Starting point is 01:12:16 tag someone that needs to know this in the comments, y'all. I'd like to make everyday AI a place where we can learn together, where we can share. But you got to do your part too. You got to share. You got to give me feedback on what's working and what's not. So I appreciate you listening. I know this was a long one. If you're on the podcast, thank you for sticking with us.

Starting point is 01:12:33 Like I said, this might be one of those ones where you need to go, click the show notes, go watch this video. It'll hopefully explain things a little better. So thank you for tuning in. Go to your everyday AI.com. If you haven't already, sign up for the free daily newsletter. We're to be recapping this one. It's not going to take you an hour plus to get through the newsletter.

Starting point is 01:12:51 It'll be about seven minutes for you to become the smartest person in AI at your company. Thanks for tuning in. Hope to see you tomorrow and every day. For more every day AI. Thanks, y'all. Meet Firefly AI Assistant. Now live in Adobe Firefly, the Allman One Creative AI Studio. Just describe what you want to create in your own words and the assistant handles the rest,

Starting point is 01:13:19 orchestrating multi-step workflows across Adobe Creative Cloud apps. including Photoshop, Premiere Express, and more in one conversational interface. You direct the outcome while the assistant accelerates execution. Stand control with the ability to step in and refine at any time. See it today at firefly.adobie.com. And that's a wrap for today's edition of Everyday AI. Thanks for joining us. If you enjoyed this episode, please subscribe and leave us a rating.

Starting point is 01:13:52 It helps keep us going. For a little more AI magic, visit Your EverydayAI.com. and sign up to our daily newsletter so you don't get left behind. Go break some barriers and we'll see you next time.

Everyday AI Podcast – An AI and ChatGPT Podcast - EP 514: Google’s AI Studio - 5 time-consuming tasks you didn’t know you can automate

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.