Everyday AI Podcast – An AI and ChatGPT Podcast - OpenAI’s new ‘12 days’ features: What’s here so far and what it means

Episode Date: December 12, 2024

o1 Pro? Sora? Apple Intelligence? Canvas coding?  Huh?  We're only halfway through OpenAI's '12 Days of OpenAI' yet everyone's favorite AI chatbot has already gotten a facel...ift in a half.  Can't keep up?  Confused with what's been announced so far?  Join us live, and we'll break it all down. Newsletter: Sign up for our free daily newsletterMore on this Episode: Episode PageJoin the discussion: Ask Jordan questions on the OpenAI updatesUpcoming Episodes: Check out the upcoming Everyday AI Livestream lineupWebsite: YourEverydayAI.comEmail The Show: info@youreverydayai.comConnect with Jordan on LinkedInTopics Covered in This Episode:Expansion of OpenAI's Reinforcement Fine Tuning Research ProgramUpdates to Canvas in OpenAI's ChatGPTGoogle's AI advancements including Gemini 2.0 FlashGoogle's deep research tool and its market implicationsDay 1 of OpenAI's 12-day feature release eventO1 model and Chat GPT Pro announcementPredictions on further premium features in AI plansCanvas and ChatGPT integrationSiri and ChatGPT integration in Apple's devicesDiscussion on Sora, OpenAI's text-to-video modelLimitations and the future of Sora's availabilityAccount tiers and upgrade limitations in OpenAI's offeringCompetitive landscape: OpenAI, Google, and GeminiOpinion on Google's market strategyPersonal user experience with Google's deep research toolPredictions about OpenAI's upcoming releasesTimestamps:00:00 Google dominates AI news with significant updates.04:46 Gemini 2.0 powers AI for web interactions.06:55 Google's Astra app: multilingual, memory, screen sharing.12:55 O one excels in data-intensive fields.14:17 Premium AI plans will cost hundreds monthly.16:58 Using large language models for initial thinking.21:19 Affordable AI simplifies costly advertising campaigns greatly.24:29 AI storyboard improves video coherence control.28:01 Simplified model tuning clarity over cost, complexity.30:38 Sponsor show for coffee mentions; Canvas updates.34:25 Canvas now available in GPT for free users.36:16 Interactive writing and coding with ChatGPT Canvas.41:12 Prefers Google voice over Apple Siri/Alexa.45:09 Google Gemini front end gets crucial updates.48:41 OpenAI to release ChatGPT projects and voice updates.49:28Send Everyday AI and Jordan a text message. (We can't reply back unless you leave contact info) Start Here ▶️Not sure where to start when it comes to AI? Start with our Start Here Series. You can listen to the first drop -- Episode 691 -- or get free access to our Inner Cricle community and all episodes: StartHereSeries.com Also, here's a link to the entire series on a Spotify playlist. 

Transcript
Discussion (0)
Starting point is 00:00:00 This is the Everyday AI Show, the Everyday Podcast where we simplify AI and bring its power to your fingertips. Listen daily for practical advice to boost your career, business, and everyday life. Meet Firefly AI Assistant, now live in Adobe Firefly, the All In One Creative AI Studio. Just describe what you want to create and the assistant handles the rest, orchestrating multi-step workflows across Photoshop, Premiere Express, and more in one conversational interface. You direct the outcome. The assistant accelerates execution. If you want new AI features, this is the show for you.
Starting point is 00:00:51 My gosh. So today we're going to be going over what's new and what's been announced with OpenAI's new updates. They've had this 12 days of ship miss or 12 days of Open AI. And we're just about in the middle of it. So today we're going to be recapping not just. what's new with all of these updates, but what they mean and how you can actually use them. So I'm excited for today's show. I hope you are too.
Starting point is 00:01:22 What's going on, y'all? My name's Jordan Wilson, and I'm the host of Everyday AI. So this thing, it's for you. It's your daily live stream podcast and free daily newsletter, helping everyday people like you and me, not just learn about AI, but how we can all actually leverage it to grow our companies and our careers. Is that you? If so, this is for you.
Starting point is 00:01:44 Another thing that's for you, our website, my gosh, you got to go to it. It's your everyday AI.com. There you can listen to anything, right? Whatever you want to learn in generative AI, we have it there. We've talked to the world's leading experts on just about everything. You can go watch, read, and listen every single episode 420 plus, as well as go sign up for our newsletter, where we recap each day's show as well as bring you everything else. that you need to know to stay up to date.
Starting point is 00:02:13 All right. So today's show is going to be a little different, a little special, I think, right? Because every day we start off with the AI news. And today, actually, Google shipped so much. Today's episode, I mean, it's going to turn into a Google slash open AI update. So yeah, we're going to go over the 12 or at least, you know, the first five days of open AI's updates. But going over the AI news today, well, Google's.
Starting point is 00:02:40 got it on lock because they announced, I think, let me just say this out loud. I think Google announced more in the last 24 hours than open AI has in the first five days of their kind of 12 days of open AI. So we're going to start off the AI news, which is literally just Google updates because I think they're that big and that significant. Also, let me know, live stream audience. I had some issues last time. Hopefully y'all can hear me today. All right, but let's talk about AI news today. This is all Google. My gosh, they shipped. All right. So first, Google has unveiled Gemini 2.0 Flash, the latest in its AI model lineup, promising enhanced speed and performance over other previous versions. So the new model is part of Google's strategy to encourage developers to create
Starting point is 00:03:32 innovative applications using AI Studio and Vertex AI. So, Yeah, if you're going on the front end of Google Gemini, you're not going to see Gemini 2.0 flash. Or actually, you know what? There's been so many updates over the last, you know, a couple of days. Okay, actually, I'm wrong. You will see 2.0 flash on the front end. Yeah, that's right. Finally, we get some front end updates for Google Gemini.
Starting point is 00:03:55 So Gemini 2.0 Flash is a multilingual and multimodal supporting text, imagery, and audio inputs, and can respond in any of those modes. So the model introduces a, and this is on the Google AI studio side, a multimodal live API, enabling real-time conversation and image analysis. So Google aims to integrate AI agents also into practical applications. Yeah, and we're going to be talking about some of these exploring projects like Project Astra, Project Mariner, and Jewels, which focuses on human agent interaction and coding assistance. So Jewels, let's talk about that. very quickly. That is a new AI-powered coding agent, so specifically made for coding and development tasks. And it's set to assist developers with coding tasks in Google collab and other IDs through
Starting point is 00:04:48 currently available, and that's only available right now to trusted testers. All right, the next one, Google's Project Mariner. All right, so this was codenamed previously. I believe it was Project Jarvis. You know, I said, that name's not going to happen. because that's going to be another lawsuit, like the last company that tried to do Jarvis. So Google has officially unveiled the Project Mariner, which is the first AI agent capable of navigating the web autonomously. So yeah, you can kind of do it right now with Claude's computer use, but you can't really do it on the front end. You have to do a little bit of setup, you know, download something on your desktop. So this is the first one.
Starting point is 00:05:30 It's not available to everyone yet. But all these new things that we're talking about here from Google Gemini, are powered by the new Gemini 2.0 Flash. So this new Mariner is a Gemini-powered AI agent that can control a Chrome browser, moving the cursor, clicking buttons and filling out forms, mimicking human interactions on website. So it's initially just released to a small group of these pre-selected testers. So yeah, who knows when we're actually, the rest of us are going to see it.
Starting point is 00:05:57 But it's moving toward an AI-mediated web interaction rather than just direct user engagement. So the Mariner AI agent can perform. tasks such as creating shopping carts on grocery websites, finding flights, and exploring recipes. Here's how it works. So screen activity is monitored through screenshots sent to Google Gemini in the cloud, ensuring users are aware of the agent's actions. And right now, Project Mariner operates only in the active tab in Chrome. So it does require users to kind of watch its activity or when I'm using it, I'm just going
Starting point is 00:06:29 to give it its own computer and be like, yo, go do your thing, buddy. and Google says that is intentional for transparency. There is no specific timeline for a broader rollout of this Project Mariner, but we'll see. Google, a lot of times they either ship something right away or they put out a little tease, and it could be many months like we just saw with Sora. All right, Project Astra, Google, yes, more Google AI news, y'all. I wanted to bring you other AI news, but I didn't have a choice.
Starting point is 00:06:59 So now we have a wait list. Yay. Google has opened Project Astra, which we've talked about many times on the show, to its trusted tester waitlist. So this is its new augmented reality kind of project, very similar to what we saw demoed on 60 minutes from OpenAI with its advanced voice mode, the ability to kind of see real-time video and interact with that. So a little different than some of these other things that we're talking about. But according to Google's DeepMind page, Project Astra allows users to engage. via a mobile device and prototype AR glasses aiming to enhance user interaction through an experimental Astra app. So participants can use the app to open their camera and share their screen, gaining quick
Starting point is 00:07:45 information about objects they see leveraging Google's apps like maps, lens, and search. So the AI model behind Astra is described as multilingual, allowing users to interact in their preferred language with the AI providing vocal and written responses. So Google has also integrated a memory feature into Astra, enabling it to recall key details from previous interactions, similar to the saved info feature in Gemini. And yeah, we saw Project Astra initially released or teased at Google's IO conference many, many months ago. So still not out, but we saw some new features, pretty impressive. All right. Last but not least, I promise this is it, y'all.
Starting point is 00:08:27 So Google has unveiled deep research. This is huge. I freaking love this. All right. I'm not going to go ahead and say it's like a Suno moment for me or a notebook LM, but my gosh, I am loving deep research so far. So Google has just released deep research. It's its latest AI advancement.
Starting point is 00:08:48 And it promises to change how users conduct online research by automating the process. So it was just announced and it's part of the Gemini Advanced Suite. So you do have to be on that $20 a month. Think of it like this. Here's what it is. And I'm going to skip all my other bullet points. You know how perplexity can go through and look at 15 to 20 websites? I love perplexity.
Starting point is 00:09:09 I've loved it all along. This is going to kill it. If perplexity doesn't pivot, I'm dead serious, y'all. Love perplexity. Use it every day. I used Google's deep research last night to help me prepare for this show. my mind was balone. Okay.
Starting point is 00:09:29 It visited 169 websites for me. Let me repeat that. It, I mean, it was a very long prompt, right? And I gave it a lot of instruction. But it visited 169 websites for me. It did all the browsing, right? So, you know, Open AI has their, you know, chat, TBD search, perplexity, perplexity, fantastic.
Starting point is 00:09:52 this could be a literal not I'm not going to use the word game changing. I'm going to say the word human changing. I don't know unless it's a quick one-off thing. If I'm going to be doing any other web browsing, I'm probably just going to be using this new deep research. It is so, so good. So make sure you go check that out. But again, you do have to have the Gemini Advanced Plan $20 a month.
Starting point is 00:10:19 I don't know why. I don't have it on my workspace account. which I pay for, but I have it on my personal Gmail. I pay for the advanced version there as well. So it may not be rolled out if you are using Google workspace in your organization or your business account, but you might have to pay for it on your personal Gmail, but it is there. All right.
Starting point is 00:10:39 Gosh, I told you all, hey, I warned you guys ahead of time. This was going to turn into a dual episode of OpenAI and Google News. But all right, let's get to this stuff. You actually came for the 12 new features. So what is going on? Well, I think this is also a good example of go-to-market strategy, right? Because no one was really talking about Google, right, until yesterday. And they went nutty.
Starting point is 00:11:04 But we've all been talking about the 12 days of Open AI. I love what, and maybe this is just the startup mentality, but Open AI, I love what they've done with this. You know, they have a live stream every day at noon central, 10 a.m. Pacific. It's good. It's entertaining. It's fast. and it's detailed. So every single day from it started last week until I believe the last day is
Starting point is 00:11:27 December 20th. So next Friday, they're releasing or announcing one new feature or one big update. So we are about halfway. Today marks the halfway point. So we're going to be going over the first five and I'm not just going to be rattling off like, oh, here's what's new. But I'm going to tell you what it all actually means. All right. So day one, here we go. And y'all, if you do have questions as we go along. Please, please get them in. All right. So day one, we actually kind of got two announcements. They could have, if I'm being honest, they could have made this into two separate announcements, but that's fine. So on day one, we got both chat GPT pro, which is a new tier, the elite cool kids club or country club of chat GPT at $200 a month. So we got a new chat GPT pro. We got the
Starting point is 00:12:15 full 01 model. Okay. So previously, this, uh, Open AI's new kind of reasoning model, which is completely different than their GPT models. We just had a preview. We had the preview in the mini. So now we have mini, 01 full, and then you have O1 Pro. But you only get that O1 Pro if you are on the $200 a month pro plan. So there's some other features of that pro plan when we talk about SORA, which we're going to be getting to here in a second. So that's what was announced on day one.
Starting point is 00:12:47 So here's what these new modes and offers, do, right? So the new O1 updates. So according to Open AI, it offers a 34% reduced error rate versus previous models. It delivers deeper analysis and a PhD level reasoning for complex research, math, and coding. And it does allocate what they said, extra compute time for the O1 Pro. So we don't even know is O1 Pro just like, is it a completely different model? Was it trained differently? There's not a ton of information out yet. Essentially what Open AI said is, yo, we just give us some extra juice. We give it extra compute, right? So that makes me think it's like, okay, is the 01 model being like somewhat throttled or could the O1, the full O1 model do what O1
Starting point is 00:13:31 pro is capable of? I don't think so. It does just seem like, you know, there's just extra brains in this O1 pro model. So if you haven't used O1, I think it's limited use cases, right? But if you're in data, if you do anything around data, if you do anything around business intelligence, if you do anything around software development, if you do anything around research, if you do anything in the academic fields, I think O-1 is a great model. So again, I'm not going to go too deeply, but O-1 is completely different than traditional large language models, what many of us use. So it does use this kind of chain of thought or reasoning.
Starting point is 00:14:07 So it thinks. It thinks like a human, right? And sometimes it will think for multiple minutes before it gives you a response. So it's not this quick, topical response that you're used to with the GPT series of models. it really thinks and it reasons and it is really good. So what's the availability? Well, now. No wait list.
Starting point is 00:14:26 It's available now. If you do want that 01 Pro, you do have to be on that chat GPT pro plan, which is $200 a month. So what makes this unique? Well, there's a couple things here because it's technically like two or three different things, right? The chat GPT Pro plan, well, what makes that unique is, well, it's the most expensive kind of consumer grade subscription there is out there, right? Yeah, we had Devin's, you know, or Cognitions Devin that was just announced this week that we covered in our newsletter for $500 a month, but that's not for everyone. So this is, I would say, the first time that we've seen an AI or a
Starting point is 00:15:06 large language model related plan that is really premium. And I've been saying this for more than a year. I've said all along, there's going to be AI plans that are hundreds or thousands of dollars a month as we get more and more capable models, right? As we, you know, maybe officially or unofficially crossover from the AGI tier, working toward the ASI tier, right? Even as compute becomes too cheap to meter, we're still going to have plans. And I do see this eventually from, you know, maybe Claude, maybe Microsoft, maybe Google, right? But I don't think it's going to be wow, to think about that there's going to be plans that cost hundreds or thousands of dollars a month, right?
Starting point is 00:15:48 Especially as these models become, you know, multi-agent environments. Think of reasoning models that have access to tools that can perform actions on your behalf, right? Who wouldn't pay hundreds or thousands of dollars a month, even if it's per user? But what makes the 01 pro model unique is, well, it just gets extra juice, right? And it's according to all the benchmarks, we already went over the benchmarks earlier. The benchmarks are very impressive, very impressive at math, coding. You know, think of anything that a PhD would be capable of. You know, that's what O1 Pro essentially is.
Starting point is 00:16:28 It gives you highly accurate PhD level intelligence in your pocket. Wild. So how can you use it? Right. I told you I wasn't just going to rattle off a bunch of stats at you today, although we're going to be doing that, but, well, it can solve advanced scientific and mathematical problems with research grade accuracy. It can tackle intricate coding challenges, and it can execute tasks demanding careful, consistent reasoning. All right. So give it a try, right? People are like,
Starting point is 00:16:57 oh, should I do the $200 pro plan? Well, we're going to talk about SORA. And maybe the answer to that depends on if you're going to be using SORA a lot. And I do think that we're going to see more differentiation. I said this last week, the week before, or sorry, I saw this. I said this last week once the pro plan was announced. I'm like, there's going to be more features that OpenAI announces during this 12 days that are going to be extremely, or sorry, that the paid plan is going to be limited.
Starting point is 00:17:24 The $20 a month plan, it's going to be limited for a lot of these new features that we saw, saw that immediately with SORA. You don't get access to everything on the $20 a month plan. So the $200 a month plan, there's going to be a lot. But, you know, even on the $20 a month plan,
Starting point is 00:17:39 you can get your feet wet with 01. I still use a one every day, even though I'm not a researcher. I'm not using it for, you know, PhD level mathematics or coding challenges, right? I'm dumping a lot of data on it and I'm having it think for me, right? It's weird now. So often, at least when it comes to business, I always have multiple large language models do my first round of thinking for me when it comes to a deep task. Even something like planning today's podcast, right? I go to reasoning models first, right?
Starting point is 00:18:13 It's like, oh, how should I open it? How should I plan it? How should I structure it, right? Here's all my data, all my notes, right? Because I'm taking notes. Put it all in a good format for me, right? So it's not just if you're a mathematician. All right.
Starting point is 00:18:27 So day two, SORA. All right. We covered, did a little review of SORA on our YouTube channel and we shared that in our newsletter as well. But hey, live stream audience, let me know. Are you guys interested in SORA? It's weird. I've, you know, in some previous lives, I did a lot of video shooting, a lot of video editing. You know, I don't really do any of that anymore.
Starting point is 00:19:00 But it's really good. It's really good. And, you know, and I keep thinking in my head, who is Sora actually for, right? And I think very early on creatives, right? But I think eventually everyday people are going to be using it. Right. As content becomes easier to create with AI, I think consumers' expectations for highly personalized content is going to increase, right?
Starting point is 00:19:29 So I would say, let's just say, let's just say for easy use, right? Fortune 500 companies, 500 companies. Let's say only 10 of them really leaned heavily into. creative video production traditionally. Now I think it's going to be the other way around. It's going to be 90%, right? Because consumer expectations are going to increase because we're going to start getting highly personalized, very high quality video content that speaks to us.
Starting point is 00:20:00 So I think what SORA is starting to bring is going to change it. So yeah, SOR if you don't know, it is Open AI's text a video model. This is one of those ones. Talk about wait list. I sometimes call Google out. You got to call OpenAI out on this one, right? And I know there's reasons. You know, they previewed this in February.
Starting point is 00:20:20 So this was a 10-month wait list. And I said in February, I said in February, this isn't going to come out until after the U.S. election. It didn't. Right. So there's certain things when it comes to safety, you know, even when it comes to following laws, right? So as an example, Miriam here is saying no SORA here in the UK.
Starting point is 00:20:39 Yeah, CEO Sam Altman said, it's going to take. a while, right? There's certain laws in certain countries that might prohibit things. And it's for safety reasons as well. So I get it. Super long wait. But let's talk about what Saur does. So it transforms detailed text prompts into HD videos up to a minute long if you do have that $200 a month plan, the pro plan. It creates, it creates, extends, and refines scenes with multiple characters and precise emotions. And it maintains high visual fidelity and prompt adherence. So is it perfect? Absolutely not. Right? No. Yeah, try something that requires real world physics. Not good, right? Try to run a Sora prompt with something with gymnastics, right? Something with a human,
Starting point is 00:21:26 you know, or a group of people mountain climbing, right? The physics still aren't there. So you do have to understand what Sora's strong suits are, capabilities. you're going to start with text, if you're going to start with an image, but if you know what you're doing, and you might have to run four different variations to get one that works. But for the most part, if you know what you're doing, you can create video that five years ago would have taken thousands of dollars, multiple humans. You can create something decent in a couple of hours, right, by stitching some of these short clips together.
Starting point is 00:22:01 You could create a promo video for your business. You could create an advertiser. advertising campaign. You can create so many things that, again, would have taken a big budget and a lot of humans. You can do that with a $200 a month subscription now, right? I don't think you can do it with a $20 a month because your credits get burned quickly. And you do sometimes have to run multiple generations to get something usable because you get something different every time. But it can, again, I'm not going to save replace, but it can really do what thousands of dollars in multiple humans used to do. five to 10 years ago. Extremely impressive. Well, the availability, it's availability to all chat GBT plus in pro users. In future tailored pricing is planned. So yeah, we might see a credit system because, you know, I saw online, even people that have
Starting point is 00:22:53 the chat GPD, you know, pro plan, $200 a month. It is a credit-based system. And you essentially have unlimited generations of lower quality. But if you want HD quality, you're going to. going to run out. You could even run out pretty quickly on that $200 a month plan, right? Where I think, you know, other video generators where they, where they might excel is allowing you a bigger plate, right? With a runway, you can eat more, right? With Aluma Labs, you can eat more. Right now, I think SORA is higher quality than some of those. People might disagree with me on
Starting point is 00:23:30 that. If you look at the ceiling, right, if you're not looking at the average, because tools, like runway, like Luma, like Kling, all of these other AI video generators, they've gotten, you know, millions of data points. SORA doesn't have that yet, but they will soon. Again, this is the worst SORA will ever be. But I think SORA has a higher ceiling. The average output right now, SORA might not be the leader because it's a couple days old, right, in terms of the general public, right? Opening AI servers have been crashing. Like I saw, I read online, December has been Open AIs most, or I guess their hardest hit month in terms of uptime, right? Everyone's rushing and crushing the server.
Starting point is 00:24:13 So it will hopefully get a little bit better. Adobe just introduced an entirely new way to create, bringing the power and precision of its creative suite into one conversational experience. Meet Firefly AI assistant now live in the Adobe Firefly app, the all in one creative AI studio. by Adobe's creative agent, Firefly AI assistant lets you start with your vision, just describe what you want, and shape the outcome as it takes form with the assistant. The assistant orchestrates multi-step workflows, drawing on 60 plus pro-grade tools across Adobe Creative Cloud apps, including Photoshop, Illustrator, Premiere, Lightroom Express, and more to help bring your ideas to life.
Starting point is 00:25:02 You can also get started with creative skills, a growing library of pre-built workflows for common creative tasks like batch editing photos, creating mood boards, portrait retouching, and creating social variations. Every step the assistant takes is visible so you can refine, redirect, or take over at any time. You stay in the driver's seat as the creative director. Adobe Firefly AI assistant now in public beta. See it today at firefly.adobie.com. Yeah, a couple people asking about VPNs, right?
Starting point is 00:25:38 So, yeah, if you're in Europe, certain countries in the EU, you don't have access. And if you use a VPN, you could try, right? But they explicitly said they are going to ban accounts that are doing that. So I wouldn't necessarily recommend it. All right. So what makes SORA unique? Well, there's a lot of features that I like that are super easy. So I'm going to talk about a couple here.
Starting point is 00:26:04 So one style presets, not. Super unique, but I think it's unique in the way that it's implemented. So you can very quickly, you don't have to be a prompt engineering expert. You don't have to be an AI video aficionado to be able to really get the most out of SORA with some of these presets. But I love the storyboard feature. I think it's not super mature yet, but I think it's going to get better. It did play around with it a little bit. But this is essentially a way that you can make a coherent story, right?
Starting point is 00:26:36 Because that's been one of the biggest challenges with AI videos so far is having, being able to piece together multiple of these clips, right? Because if you try to let it create something long, it may go off the rails. So generally, your best generations using AI video tools are running shorter ones that you have more control over and probably starting with an image versus text, right? So what a lot of people have done is, you know, there's now AI image tools that allow you to create consistent characters. So then they just do that and then create, you know, line those up. and create multiple videos. But with the storyboard feature, it does that. Right. So it's like you can literally run multiple text prompts, you know, one after another, after another after another, and SORA does its best to make them all work together. Super impressive feature. So how can you
Starting point is 00:27:23 use it? Well, we already talked about this, but you can use SOR to quickly produce concept visuals and unique animations. You can generate educational or marketing videos with minimal effort. you can streamline content creation, saving a ton of time and resources. All right. All right. Let's keep it going. Let's keep it going. Liz said, Sora broke my heart.
Starting point is 00:27:44 They're on GPT teams and you didn't get access to it, but Plus did. Yeah. Teams and enterprise clients aren't always going to be able to get access to all of these things, right? I have multiple free accounts. I have chat GPT Plus accounts. I have chat GPT teams accounts. I have enterprise, right? Because clients hire us to help,
Starting point is 00:28:07 help them help their teams learn chat GBT. So yeah, some of these are, if you want access immediately, you might just have to be on the chat GBT plus plan or on now on the pro plan. All right, day three. All right, we're going to go a little faster. This one's dragging on.
Starting point is 00:28:24 Google, Google had to screw up the timing and just ship like crazy. All right. So day three was reinforcement, fine-tuning research program. Little technical, but super cool. All right, so here's what was announced. Well, it was an expansion of the RFTRP or RF.
Starting point is 00:28:41 I don't even know if we're going to abbreviate that one. So the reinforcement fine-tuning research program. It was an expansion of the program enabling reinforcement-based custom AI model development. I know this sounds dorky. It's not as dorky as it seems. So here's what it does. It employs reward-driven loops. to refine model reasoning and accuracy.
Starting point is 00:29:02 You can create specialized models in narrow domain, such as medical or legal, with minimal data. That is the key here. We're talking about dozens of examples, not hundreds of thousands, right? When you would talk about traditionally fine-tuning models, right, with a take a long time. So now, with this kind of new program, we're talking about dozens of examples. All right. So it essentially creates expert models.
Starting point is 00:29:30 models with significantly improved task performance. Right now, it is in alpha, so it's not even in beta phase yet. So, you know, I wouldn't see planned for general availability for this anytime soon, although Open AI did say that a release for this is planned for early 2025. So here's what makes it unique. It uses a greater that you can reward desired outputs, boosting correctness, and reasoning quality. And it requires, like I said, way fewer.
Starting point is 00:30:00 examples. So think of, this is not, sometimes y'all, like, this shows for everyday people. Sometimes I sacrifice 1% of accuracy so I can get 99% more clarity. Think of it like this. Two years ago, if you wanted to fine tune a model, right? So that means make this big model specific for your domain. It would cost a lot of money, sometimes millions of dollars. It would take a lot of engineers. And it would take sometimes hundreds of thousands of your own data points to essentially train the model. So think of it as you're using a car's engine, right? And then you're building, you know, the tires and the, I don't know anything about cars. I shouldn't have used that as an example, right?
Starting point is 00:30:49 But, you know, you're building all these other pieces and it takes time. Now it's not like that anymore, right? So what this could lead to is customizing. your own car, but with a couple of clicks, right? And not having to have a team of people building the rest of the car, just starting with the engine. So, you know, how we can use and build these models is going to really change. And being able to get a fine, essentially a fine-tuned model with only a couple
Starting point is 00:31:19 dozen examples, right, where you have to train the model, hey, when someone asked about this, this is a right answer, this is a wrong answer, and this is why, right? you essentially easily can classify those things. It makes it simple, right? It makes it to where non-technical people and smaller businesses, startups, can start using this once it's available, whereas normally, in order to fine-tune models, you had to be an enterprise company with a bunch of, you know, AI engineers, essentially. So here's how you can use it.
Starting point is 00:31:49 You can produce high accuracy models in focus fields. You can improve outcomes on complex, domain-specific tasks, and achieve strong performance with fewer examples. All right. I see a couple of questions, y'all. Keep them coming. I'm going to tackle any of the questions that you have at the end as I'm scrolling through here. So let's talk about day four.
Starting point is 00:32:13 All right. So day four, Canvas updates. So here's what was announced. Canvas was previously only for chat GPT plus users. Now it's available for all. So if you are using the free version of chat GPT, which I would not recommend. Just pay the $20 a month, y'all.
Starting point is 00:32:29 It's the easiest, like, you know, think of your coffee. I'm sipping a coffee right now. Hey, shout out an espresso. I love an espresso more than anything else. They should sponsor this show because I'm always sipping, sipping on their coffee in the morning. But think, if you go get a coffee at Starbucks or your local coffee shop, let's say you do it twice a week, three times a week.
Starting point is 00:32:48 Well, there's your $20 a month, chat chbt plus, or, yeah, chat chbtee plus subscription plan. You know, people, if they're using the free version, they just give you enough to get a taste. But it's not enough to actually get work done if that's what you want to do. Anyways, so the big update with Canvas, well, it's available to everyone. There's some other bells and whistles. Essentially, Canvas was previously its own mode, but now it is not its own mode. So it is used in the tools section. So there's now a new icon when you are using ChatGBT.
Starting point is 00:33:21 And essentially, it is a tool that you can use inside. any mode now, aside from the O1 models in advanced voice mode, but essentially it's not its own mode anymore. So it's a tool. You click on it to use it. The other big update here is it can also run Python, and it does now work with custom GPTs. All right. So as I go over what it does, hopefully that'll explain a little bit more. So Canvas serves as a side panel within chat GPT for collaborative writing and coding projects. You're not collaborating with teams, right, like in Microsoft pages. You're collaborative. with chat gvety. So normally when you are conversing with chat gpt, right, you give it a prompt. It
Starting point is 00:34:01 gives you a long answer. If you want to change something, you got to kind of reprompt. But Canvas changes it. So think of it as a split pane. So on the left side, you're talking to chat gbt on the right side. It's like an editable document. That's a hard word to say at 7 a.m. in the morning. Editable. Editable. All right, you can edit the document live, right? And you can also work with chat GPT live, right? So think of in a document, you can go, you can type something yourself, tag chat GPT in without having to constantly repromp, scroll up, you know, it's just a much better experience, right? I actually like that they called it canvas, because that's what it is. It's a canvas and you can stay in that canvas and work on it. And
Starting point is 00:34:46 chat, GBT, right, there's all these nice little, you know, one touch buttons, you know, to polish the writing, change the writing style, do. code debugging, right? But you can stay there. So it's just much easier to navigate with this kind of dual pane, right? So you can essentially, you know, kind of quote unquote command chat TBT on the left pane and then on the right pane, you can see it work. You can go in there, work with it, change some things manually. It's a great mode. So it does allow direct editing of text and code, support shortcuts and keep version history. Also, now the big thing is you can run Python code. So this is a shot at artifacts, right? I do think.
Starting point is 00:35:22 in Anthropics Clod, the Artifax feature was its best, it's best feature, right? That's 80% of what I use Claude for is for artifacts. Claude still has, and Artifax still has way more advantages, because essentially, it can run any code. Artifacts can, or much, you know, I don't have a complete list, but you can run, you know, you can create an HTML website, you can create a business to that dashboard, right? You can do all these things. So right now, the new Canvas updates, you can just run Python, right?
Starting point is 00:35:51 So you can't just render out any type of code. But Python's a great start, right? And this is something at least if you're an engineer, software developer, coder, business intelligence, etc., you're using Python. Python is an extremely flexible programming language. You can create graphs with Python. You can create games with Python, right? You create so many things with Python with that programming language. But Cloud Artifax still has an advantage if you're just looking at a large language model's ability to render something.
Starting point is 00:36:21 in the window. So it's available now, right? So yes, Canvas available now, even if you are a free user, and it's available inside of GPTs. So you do have to have a paid account to create GPTs, but now you have that kind of canvas feature or canvas ability inside of your GPD, which I think is really nice. So what makes it unique? Well, it, like I said, provides a visual side-by-side interface for working directly within chat GPT. ChatGPT also gains better content. So, what makes it that are context awareness offering targeted inline feedback and suggestions. You know what? I actually haven't done context testing within chatGBT, which I probably should do, right?
Starting point is 00:37:08 So I'm assuming that it's not going to eat up as many tokens if all it's doing. So let's say, you know, you asked chat GPT to spit out a thousand words, let's just say, or a thousand tokens, right, each time. And you're just slightly reiterating. Eventually, you're going to hit that 32,000 token limit in chat GPT on the front end, right? If you're using chat GPT on the back end, it's 128,000 tokens. That's its memory before it starts forgetting things. So I do need to do some testing because I'm assuming that the canvas, using the canvas
Starting point is 00:37:39 feature is technically going to eat up a lot fewer tokens. So even if you don't think that you're going to need that direct editing feature, I don't necessarily see a downside to using Canvas most of the time for most everyday use. Also, what's great about Canvas is you can highlight specific sections for focused assistance. And also, there's those nice little shortcuts on the side, right? Where if you're working with code, it can debug, suggest improvements, you can change the writing tone with one click, you can make it longer, shorter, right? Very cool.
Starting point is 00:38:17 just easy button, you know, like huge one-click that can change the output. So how can you use it? Well, you can collaborate interactively with chat GBT on writing or coding tasks. You can run and refine Python code right in the workspace for immediate validation. And you can request and receive targeted at its suggestions and improvements on your work without even having to talk to chat GBT, right? Sometimes you're like, oh, okay, how do I, you know, have chat GBT you know, change it this way, right?
Starting point is 00:38:49 And you're thinking and you've got to type it out and you're like, ah, that's not right, right? And you're like wondering, how can I give chat TBD feedback? Well, in the canvas mode, which is, I don't think anyone else has that level of one-click flexibility, right? There's other large language models where you can, you know, change the tone, make it longer or shorter with one-click. But chat chabit in the canvas mode really brings a lot of easy one-click modifications to that
Starting point is 00:39:13 toolbar. So like I said, if you're not using canvas mode, it's pretty much my default. mode, even though it's not its own mode now. It's a tool, but there's, I see very few instances where you shouldn't be using it as your default. All right. And now we have our last day, here we go. We got to the end, day five, y'all. So it's the Apple chat GPT partnership. Is this new? No. Nothing was new here. Nothing was new except it's out. All right. So, you know, I think all these other things, there's been a certain level of surprise or intrigues. So, you know, day five, not so much.
Starting point is 00:39:49 It was just essentially the Apple and chat GPT partnership is live, all right? As Apple updated their iOS and their Mac operating system. So here's what it does. We've covered it very, very, very in depth here on the show. So it embeds chat GPT into Siri and also Apple's writing tools. It uses essentially chat GPT. So there's settings that you can go into. on your iPhone and enable this.
Starting point is 00:40:17 And or chatDB will ask you or sorry, Siri will ask you. So if you're asking Siri, something that is a little too complex, it will say essentially, do you want to use chat GPT for this? Or you can just kind of override that and make that your default setting within Apple's settings. So it does embed chat GPUT into Siri and Apple's writing tools for advanced reasoning and content generation. Can analyze images in documents natively with no extra accounts needed. Also, this new integration that's live enhances series intelligence beyond standard assistant functions. Are we finally going to have a smart AI assistant?
Starting point is 00:40:54 Well, we'll see. I think my devices, I didn't even have time to update them yet. All right. So here's the availability. It's availability. It's available now. Right. So you do just have to update to the latest iOS and the Mac OS updates.
Starting point is 00:41:07 A lot of it is device dependent, though. So you do as an example, if you're using it on your Mac, you have to have an M. chip, so an M1, M2, M3 chip. So it's not going to work on super old Macs. The same thing on the phone. You do to use the kind of the camera or the vision feature. You do have to have the newest iPhone. So that's not going to work with all of them.
Starting point is 00:41:31 All right. Because, you know, for some of these things that happen, you know, on edge AI, right? It's happening on your device. Some of these things happen in the cloud. So you don't need the most powerful device. But some of these things you do need the most powerful. processors. Yeah, Monica here is saying, finally. Yeah, I agree. I'm excited to test out that new feature and to report more on it. Because to tell you the truth, I haven't been able to research
Starting point is 00:41:56 that one as much. That is the one, the last announcement that just came out. So what makes this unique? Well, one thing that Apple does a really good job on is privacy, right? Yes, they are last to the game. I think most people, you know, by the time they hear Apple intelligence, at least if you're like me, I kind of roll my eyes. You know, I don't know. To be honest, I kind of hate the hard stance on branding that Apple is taking toward artificial intelligence by calling it Apple intelligence. I think that's a little cheesy, especially coming from the company that is multiple
Starting point is 00:42:34 years behind its biggest competitor in Microsoft, right? So to try to rebrand artificial intelligence as Apple intelligence when most people still aren't going to get these full features until 2025. And we've already had Microsoft co-pilot for multiple years now. Kind of cheesy Apple. I don't know. All right. So here's how you can use this new chat, GPT, Apple intelligence integration. So you can draft documents, emails, and analysis without leaving Apple's ecosystem.
Starting point is 00:43:05 You can summarize PDFs and interpret images right on your device. and you can enjoy faster, more integrated AI assistance for everyday tasks. I'm not going to use it. If I'm being honest, I'm not going to use Apple intelligence, at least 90% of them. I mean, we'll see if I end up using Siri. I just like using advanced voice mode on chat, GPT. Also, Google Live, their voice mode is probably the leader right now because it is the only voice mode that is neural voice, right?
Starting point is 00:43:35 I'll say that because obviously Apple and Siri or, sorry, Siri and Alexa are connected to the internet, but they're dumb assistants, right? So the only true, like, AI assistant that is connected in real time to the internet right now is Google Gemini's Live. But hope, and now Siri technically, right? So I haven't tried it out now, but hopefully we'll see some updates from the advanced voice mode. I love using it, but the downside of chat TPT's advanced voice mode is it's not connected to the internet.
Starting point is 00:44:02 So you're not getting the most up-to-date real-time information. All right. So that's how you can use this new Apple, uh, kind of, um, thing here. So let me just quickly, I saw a couple of questions. Sorry if I missed it. I'm actually on my laptop today. So normally, normally I have, you know, two computers and I can see a little bit more. So I'm going to try to get to a couple of these questions here. And then I'm going to tell you what I think is next for open AI's kind of 12 days. So Frank is asking, do I have to pay per user for pro if we are in the same organization. Frank, I believe yes, although I don't believe, I don't believe as of yesterday,
Starting point is 00:44:43 at least, I was talking with one of my contacts at OpenAI. I don't believe that pro is yet available for teams or enterprise accounts. I'm sure it will be any day now, but as far as I know, right now, right, and I have a team's account. I have an enterprise account, but I don't have billing or admin access to that. But on my team's account, I can't upgrade. to Pro. I can only do that with my kind of quote unquote normal accounts. So, you know, a free account, you can upgrade to Pro, a chat TVT plus $20 a month, you can upgrade to that $200 account. Frank also asking, where did O1 go? I had it seven days ago. For us, it's still there. So yeah, if you have a paid $20 month account, 01 is still there. Philip, joining from YouTube, says
Starting point is 00:45:33 Gemini Pro and Ultra. So can we say Gemini is the biggest competitor against OpenAI? Yeah, they are. I'm going to save this rant for another day, Philip. I think, I think, wow. I mean, if I'm being honest, I think until yesterday, Open AI was in its own tier, right? It was Tier 1, OpenAI, Tier 2A, 2A, 2A, 2A, 2A,000, you know, 2B, Microsoft. even though Microsoft's just essentially,
Starting point is 00:46:07 no, I got to change that. I would actually say previously, I would say 1A, OpenAI, 1B, Microsoft co-pilot. Then I would say 2A was probably Google Gemini, and then 2B was probably clawed inthropic. But now I think Google is back up in that tier one with what we just saw yesterday.
Starting point is 00:46:27 I mean, my gosh. I mean, the problem, they have a marketing problem. Google has a go-to-market problem. Open AI does not, right? opening eye brings, again, yeah, sometimes you get these long wait lists, but at least the products get released. Google, not so much. And sometimes they're just releasing it on the developer end, right? Which people don't understand.
Starting point is 00:46:49 People do not understand this. Five years ago, it was CTOs, Csos. It was the highly technical people that were making these decisions. So I think Google especially has this all wrong. they have their go-to-market all wrong. They're only appealing, I think, or relying too much on developers. Because aside from, you know,
Starting point is 00:47:11 finally, the front end of Google Gemini just got some much needed updates in the last 24 hours with the new 2-0 Flash and also the new 1-5 deep research. Amazing. But for the most part, the front end of Google Gemini has just been abandoned, right?
Starting point is 00:47:31 But that's what people use. I literally, literally, and this is not an exaggeration. People reach out to me all the time, and they're saying, Jordan, listen to your show, blah, blah, blah, you went off on some crazy rant, but our entire executive team looked it up and you were right, right? But these executives that are trying to make decisions on AI, they're going on the front end. They're going chat, gbt.com, jemini.com, claw.com, dot AI, co-pipe, right? That's what they're doing.
Starting point is 00:47:56 They're not going to Google's AI studio or vertex anymore because now the decision making ability, it is CEOs. It's CMOs, right? It's people that are not necessarily technical. They're not the ones necessarily going on the developer platform to test things out. I think Google has tremendously missed their go-to-market. But yeah, right now they're the biggest competitor. Sorry, Philip, I went on a rant. All right. Jay, how long did it take to give you a response after searched the 160 sites? Great, Jay. So yeah, it searched the Google. deep research, search 169 sites. I literally, like, I made a face.
Starting point is 00:48:40 If you're on the live stream, I was like this. I was up here by myself, I don't know, 10, 11 p.m. last night, doing some last minute prep for today's show. And I was legit flabbergasted. It took probably, I should have timed it. I'll probably do a video review. It probably took about two to three minutes. I'm guessing it might take longer as more people use it, right?
Starting point is 00:49:02 It might be kind of like, you know, bandwidth, right? So if you're using it during busy times, it might take longer. But it took about two or three minutes. But you know what? I'm fine with that because as I've been using perplexity, I've been using perplexity since it came out. I personally feel in my team's experience, perplexity is getting more things wrong than they used to.
Starting point is 00:49:22 It's new kind of quote unquote reasoning, not that good. I don't think this new deep research, I was blown away. And yeah, Jay, I'm pretty much ready. I'm pretty much ready to drop perplexity, not just yet, right? There's a lot of things that I had, you know, baked into our workflow that I'm not yet ready to drop perplexity because of this new deep research have to do a lot more testing. But, I mean, instantly, 80% of my use cases for perplexity went out the window. The very first time that I used deep research from Google. amazing. And that's not really happened, right? Very rarely do I use a tool once, right? Because I test,
Starting point is 00:50:07 I stress test these things, try to break them, you know, reverse engineer what's good and bad, right? Very, very, hardly ever, do I use something once and say, this is going to replace a big part of my team's workflow? That happened. That happened. All right, y'all. I don't see any more questions. I hope. this was helpful. I hope it was helpful to get a nice overview of everything that's been going on. But here's what I think is next. Okay. There's still seven days. So what is Open AI going to release? Well, I think we're going to see projects. And some of these, you know, there's some random rumors on the internet in screenshots. And some of these things are just my hunch from covering this every single day. I think we're going to see projects, which is essentially a folder system inside ChatGBT,
Starting point is 00:50:58 to help you stay better organized, but also to upload files. So something between like spaces in perplexity and projects in Claude, but essentially one spot where you can upload custom instructions, different chats, and different files. I think we're going to see that. I think we're going to see some advanced voice mode updates, hopefully the ability to use tools and the internet within advanced voice mode, right? That's what I want.
Starting point is 00:51:22 I want to be able to have this neural voice that I can talk to, like a Siri, like an Alexa, right? That's actually smart that is powered by a large language model, but I can upload my data to. And it can also look up real-time information on the internet because right now, advanced voice mode, you cannot upload your own data. It doesn't have access to any of the other tools of chat GPT.
Starting point is 00:51:43 So I think, and I hope that we're going to see something like that. I think we are going to see a GPT 4.5, maybe not released, but a wait list. I'm not sure. But I think we're at least going to see the next GPT model, since we've seen a lot of movement in the 01 series. And I think we're going to get a preview of the rumored operator agent from OpenAI, especially, right, what we just saw from Google.
Starting point is 00:52:10 We've been seeing reports for many months that Open AI has been working on this new agentic system. And that's next on their step, right? They have this five-step plan to AGI, right? And we're at reasoners. And so it's only, it's only logical that the next step, toward that is agents, right? We've seen it from Microsoft co-pilot.
Starting point is 00:52:29 We've seen it from Salesforce, right? We've seen it now from Google, right? Even though it's probably going to take many, many months for most of us to get our hands on this. But I do think that's next. So that's what I think. Let me know. Actually, right now, if you're still listening on the live stream, get your best guess in for
Starting point is 00:52:45 what's next. I'm going to grab some of my favorites and put them in today's newsletter. All right. So let me know what you think is next. All right. And also, if you haven't already, please go to your every. A.com. Sign up for the free daily newsletter. If this was helpful, please repost this. I know this was a longer show. It ended up being half Google, half open AI, but hey, in 50 minutes or in 25
Starting point is 00:53:06 minutes, if you listen to me on 2X, which I definitely would, you just got tons of hours worth of updates. And now you are the smartest person in AI at your company. All right. So thank you for tuning in. Again, please go to your everyday AI.com. Sign up for the free daily newsletter. Yeah, We recap today's episode and every episode. So when we have super smart guests, we recap the main insights in our newsletter, as well as keeping you up to date on everything else that's happening in the world of AI. Thank you for tuning in. We hope to see you back tomorrow.
Starting point is 00:53:38 And every day for more, everyday AI. Thanks all. Meet Firefly AI assistant. Now live in Adobe Firefly, the Allman One Creative AI Studio. Just describe what you want to create in your own words and the assistant handles the rest, orchestrating multi-step workflows across Adobe Creative Cloud apps, including Photoshop, Premiere Express, and more in one conversational interface. You direct the outcome while the assistant accelerates execution.
Starting point is 00:54:08 Stand control with the ability to step in and refine at any time. See it today at firefly.adobie.com. And that's a wrap for today's edition of Everyday AI. Thanks for joining us. If you enjoyed this episode, please subscribe and leave us a rating. It helps keep us going. For a little more AI magic, visit Your EverydayAI.com. and sign up to our daily newsletter so you don't get left behind. Go break some barriers and we'll see you next time.

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.