Everyday AI Podcast – An AI and ChatGPT Podcast - EP 412: AI News That Matters - December 2nd, 2024

Episode Date: December 2, 2024

SORA Leaked and now OpenAI squashed it. Will it ever come?Elon is picking more fights with OpenAI -- can he win?Claude is shipping like ships that need to ship, but does it matter?↳ So many AI quest...ions. ↳ We'll have the AI answers. Here's this week's AI news that matters. Newsletter: Sign up for our free daily newsletterMore on this Episode: Episode PageJoin the discussion: Ask Jordan questions on AIUpcoming Episodes: Check out the upcoming Everyday AI Livestream lineupWebsite: YourEverydayAI.comEmail The Show: info@youreverydayai.comConnect with Jordan on LinkedInTopics Covered in This Episode:1. Controversy Over OpenAI Sora2. Anthropic's Innovations3. Alibaba's New Experimental Model4. Uber's AI Expansion5. NVIDIA's Fugato Model6. AI Video Advancements7. OpenAI’s Revenue Strategies8. Lawsuit Against OpenAI and Microsoft by Elon MuskTimestamps:00:00 OpenAI's Sora: Controversy over artist involvement.03:23 Artists criticize OpenAI's exploitative, controlling practices.06:39 Leak increased interest in Sora, revealing features.11:17 Anthropic's open-source AI enhances internal data connectivity.15:36 Claude struggles with steering tone and maintaining structure.17:33 Reasoning models offer thought process and fact-checking.23:26 NVIDIA unveils Fugato, pioneering generative audio model.25:04 Analyzing audio clips for frequency and reverb.31:09 Horizontal video transformed into vertical with AI.31:57 Expanding video perspectives using AI-generated content.35:34 11 Labs' GenFM: Multi-speaker podcasts with AI voices.40:09 Musk lawsuit claims OpenAI's profit harms competition.44:36 Multiple AI models released; robotics raise concerns.Keywords:Everyday AI, Jordan Wilson, OpenAI, Sora leak, PR puppets, unpaid labor, YouTube content, Sora models, text-to-video tool, Anthropic, Model Context Protocol, Claude AI, Alibaba, QWQ 32B Preview, Uber Technologies, Scaled Solutions, gig economy, AI development outsourcing, labor market, NVIDIA Fugato, audio synthesis, AI Video Advancements, Luma Labs Dream Machine, LTX Video, AI autonomy, 11 Labs GenFM, Elon Musk lawsuit, Anti-Competitive Behavior, XAI, ChatGPT advertisingSend Everyday AI and Jordan a text message. (We can't reply back unless you leave contact info) Start Here ▶️Not sure where to start when it comes to AI? Start with our Start Here Series. You can listen to the first drop -- Episode 691 -- or get free access to our Inner Cricle community and all episodes: StartHereSeries.com Also, here's a link to the entire series on a Spotify playlist. 

Transcript
Discussion (0)
Starting point is 00:00:00 This is the Everyday AI Show, the everyday podcast where we simplify AI and bring its power to your fingertips. Listen daily for practical advice to boost your career, business, and everyday life. Meet Firefly AI Assistant, now live and Adobe Firefly, the All In One Creative AI Studio. Just describe what you want to create and the assistant handles the rest, orchestrating multi-step workflows across Photoshop, Premiere Express, and more in one conversational interface. You direct the outcome. The assistant accelerates execution. Did chat GPT release anything for its second birthday?
Starting point is 00:00:49 Why are Elon and open AI fighting again? Are we all going to be working AI jobs at Uber? And is chat GPT going to start shoving ads down our faces? So many AI questions this week, we've got your AI answers. What's going on, y'all? My name is Jordan Wilson and welcome to Everyday AI. If you're maybe wondering some of those questions this week as you're looking to grow your company and grow your career, don't worry. We're going to be here with some answers.
Starting point is 00:01:23 If you're new here, thank you for joining us. We do this every single day. Everyday AI is for you. This is your daily live stream podcast and free daily newsletter, helping us all learn and leverage generative AI to grow our companies and our careers. So if you're listening on the podcast, thank you as always. Make sure to check out your show notes. There's a ton in there. But most importantly, maybe is a link to our website at your everyday AI.com.
Starting point is 00:01:51 Make sure you go sign up for our free daily newsletter. So yeah, on Mondays, we bring you the AI News That Matters. So you don't have to waste hours every single day wondering how AI is going to impact your career or your company. On Mondays, you can spend time with us. But we do this Monday through Friday every single day. bringing on some of the best guests in the world, and you can ask them questions and get answers live, right?
Starting point is 00:02:14 And you can also go to our website at Your EverydayAI.com. We have more than 400 episodes. You can go listen to old podcast episodes on our site. You can watch videos on our site. You can read. There's thousands of free generative AI content. So make sure you go to Your EverydayAI.com. All right, enough chit-chat.
Starting point is 00:02:34 Let's get into the AI News that matters for this week. and thank you, as always, to our live stream audience for joining us. Michael and Sam and Philip joining on YouTube and Brian and Jay saying that he's in everyday AI withdrawal from the Thanksgiving holidays. So yeah, excited to be back with you guys all in Rolando and Marie and Woozy and Fred and Christopher, everyone else. All right, let's get into it, y'all. So Open AI's Sora.
Starting point is 00:03:03 We haven't heard about that in a while. What's going on? Well, Open AI's new text to video AI tool, SORA, is kind of embroiled in a little bit of a controversy following a leak by early access artists, highlighting tensions between tech companies and creative communities. So the artists posted in a hugging face space as PR puppet SORA. So these artists were granted free access for testing and they accused Open AI of using them as quote unquote, unquote, PR puppets. That is why the name of the hugging face space there was called PR puppet SORA. So they
Starting point is 00:03:43 accused OpenAI of using them as puppets instead of genuine collaborators, revealing a rift over the company's approach to artists involvement. So yeah, essentially late last year, or no, early this year. So almost 10 months ago, a group of select creatives got access to Open AI SORA, there was some processes, right? You had to sign, you had to sign some documents. You had to say, hey, we're going to get these things approved through Open AI before we release them. You know, Open AI has said that was kind of not necessarily control the quality, but to make sure harmful content was not getting out through its new SORA tool. But these new artists, these artists that just posted this online are not liking the terms that they signed up for. So according to their
Starting point is 00:04:33 statement on hugging face, around 20 artists claim they were misled into unpaid labor, arguing open AI exploited their creative input for marketing despite the company's $150 billion valuation. So the group criticized restrictions requiring OpenAI's approval for SORA generated content publication, suggesting the process was more about promoting the tool than fostering creativity. So in protest, these artists leaked SORA access. So, through a back end API. And they posted all of this, essentially a Python script on Hugging Face, allowing people to use SORA.
Starting point is 00:05:13 It was only up for a couple of hours. I went in and tried it out. So it did really put some pressure on OpenAI, and Open AI quickly shut this down. So they responded by suspending Sora's early access three hours post leak, defending its voluntary program and emphasizing the valuable contribution. of hundreds of artists. So not all participants, though, agreed with the dissenters, as a lot of artists went online and supported Open AI, stating that this protest from a select few did not reflect
Starting point is 00:05:47 the majority view. So the incident adds to existing scrutiny over SORA's training practices with concerns about potential use of YouTube content raising ethical and legal questions. So we did cover this in our newsletter last week, y'all. So I'm curious, though, what does our live stream audience think? I'm going to give you my takes on this. So did these artists make a point, right? So they essentially said, hey, the whole world has wanted this SORA tool. We're going to create a kind of a technical back door and allow people to use SORA.
Starting point is 00:06:22 Well, number one, it took forever. It took, if you were very early, you could get it out in a couple minutes. but for most part, you know, kind of in the middle of this three hour wave, it took like an hour. So could people really use it? Not really. And did these artists make a point? I don't know, right? You're signing up for something, right?
Starting point is 00:06:43 When you get early access to a program, you are essentially signing away your ability to make money, right? And if a company, any AI company says, hey, we're going to give you early access to these tools. here's the rules. Do you want in? Yes or no? I mean, that's what you're signing up for. So I'm not quite sure of what this group of kind of rogue artists was hoping to accomplish. But I think, if anything, they just brought more eyeballs to SORA, right? We hadn't been talking about it a lot recently. And anyone that follows the show, I've been saying all along that we're not going to get access to SORA until after the election. So, you know, I would assume in the coming months that we're going to be getting access. But a couple of things actually can.
Starting point is 00:07:26 came out from this leak, though. So yes, it just kind of was through an API backend. But we saw a couple styles, right? So that's interesting. So what that leads me to believe is that we are going to either have the ability to create images with SORA or, you know, we'll see a Dolly update that essentially uses the SORA engine. Also, it was called Turbo. That part is huge. So that indicates that OpenAI actually had different tiers or different models of SORA. So potentially a turbo version, which is the version that through this hugging face repo that people kind of got access to be at the back end, it was a turbo version. So it was supposed to be faster.
Starting point is 00:08:14 So a lot of people are saying, oh, this is the more powerful version and I'm not necessarily sure about that. I think that whenever Open AI does grant access to the rest of the world to SORA, I don't think we're going to be getting the most powerful version. If you think back to the turbo, how Open AI has used this in the past, normally it's actually a dummy down version. It's not the most powerful version, and it's just faster, right? So I don't necessarily think that when Open AI does actually release this, that we're going to get the most powerful version. There's been early reports that we've talked about on the show that they're actually working on a new version of Sora.
Starting point is 00:08:49 So I don't think what it does come out, that we're going to be getting the most powerful version. we'll probably be getting this turbo version or a version that sacrifices quality for speed. So I don't know if this worked. I think of anything, like I said, I think it just brought more eyeballs to SORA. And some of the, like some of the generations were pretty impressive. Some weren't. But the ones that worked, right, the thing with these AI video models, just like anything with generative AI, they're a role of the dice.
Starting point is 00:09:17 But the dice, when you're talking about video, have way more sides to the die. right so uh you know i think that the floor is much lower the ceiling is much higher and for those generations that hit they looked very impressive better than literally anything else out there so i think if nothing else you know this just gave soar up more eyeballs people talking about how good it was could this have been an internal leak a lot of people are speculating that oh open a i did this i don't know uh that doesn't really seem uh like a like a you know marketing ploy out of their wheelhouse i could be So, yeah, Fred is saying artists think differently. All right.
Starting point is 00:09:59 So we'll see. We'll see. All right, let's get to our piece, our next pieces of AI news, y'all. So, Anthropic just been shipping. They have been shipping so much the last couple of weeks when a lot of other big companies, you know, put out previews or they tease or weightless. Anthropic, the maker of Claude is shipping. So Anthropic has launched a new open source standard called the Model Context Protocol, or MCP, to improve AI models access to data, potentially leading to more relevant and accurate responses.
Starting point is 00:10:34 So MCP allows AI models to connect with various data sources, including business tools and content repositories overcoming the limitations of isolated data systems. So according to Anthropic, MCP enables two-way connections between data sources and AI applications, simplifying the development process by offering a standard protocol instead of separate connectors. So Anthropic also has released pre-built MCP servers for platforms like Google Drive, Slack, and GitHub, with plans to offer other toolkits for broader organizational use. So right now, this initiative aims to create a substantial, or sorry, a sustainable architecture for AI systems, maintaining context across different tools and data sets.
Starting point is 00:11:23 So despite its potential, the adoption of MCP faces challenges, particularly from competitors like OpenAI, which recently introduced a similar feature in its chat GPT platform called Work With Apps. However, Work With Apps is a one-way street, and MCP from Anthropic is a two-way street. Work with Apps is very simple to set up MCP not as simple. So you do have to be a little more technical to be able to work with your kind of internal databases here with MCP. But it's an open, they open source this, right?
Starting point is 00:11:56 So I do like that piece because, you know, I think the open source community can take this MCP protocol and really run with it and really create some great ways to connect large language models with internal data sources in the very near future. So I do like Anthropics approach here, even though you do have to be very technical, right? Just like as an example, we did this live on the show about a month ago, showing off Anthropics new computer use where you can essentially talk to Claude and it can literally run a computer like an agent. It's not very good yet, but I do like Anthropics approach here. They're not teasing things. You know, they're just shipping new helpful updates. Speaking of new updates, I don't know how helpful it is, we'll see. But Anthropic has also introduced customizable writing styles for Claude A.
Starting point is 00:12:46 So Anthropic is enhancing its clawed AI chatbot with new customizable writing styles, a move that could transform AI communication by allowing users to tailor responses to their preferences. So maybe we'll have less delving and less revolutionized or less in today's digital realm content out there, right? So this development is significant as it aims to make AI interactions more natural, less robotic and more personalized. So there are three preset styles available for everyone. So right now, this is available also for free users, although it's much more limited. So the three preset styles available are formal for professional and precise responses, concise for short and direct responses, and explanatory for detailed explanation. But the standout feature is the custom style option, which allows users to upload writing samples
Starting point is 00:13:40 and descriptions to teach Claude their preferred communication style. And then you can edit those. So over time, users can refine these custom styles, making the AI a better mimic of their personal writing, tone, and voice. So the introduction of these features raises questions about the role of AI in content creation and the potential blurring of lines between human and AI generated content. So, is it any good? Adobe just introduced an entirely new way to create, bringing the power and precision of its creative suite. into one conversational experience. Meet Firefly AI Assistant, now live in the Adobe Firefly app, the All In One Creative
Starting point is 00:14:27 AI Studio. Powered by Adobe's Creative Agent, Firefly AI Assistant lets you start with your vision, just describe what you want, and shape the outcome as it takes form with the Assistant. The Assistant orchestrates multi-step workflows drawing on 60-plus pro-grade tools across Adobe Creative Cloud apps, including Photoshop, Illustrated Premier, Lightroom Express, and more to help bring your ideas to life. You can also get started with creative skills, a growing library of pre-built workflows for common creative tasks like batch editing photos, creating mood boards, portrait retouching, and creating social variations. Every step the assistant takes
Starting point is 00:15:07 is visible so you can refine, redirect, or take over at any time. You stay in the driver's seat as the creative director. Adobe Firefly AI assistant now in public beta. See it today at Firefly, Adobe.com. Meh. So, I mean, I'll say this. And let me explain this, right? Because I know some people are going to get angry, but I like to tell people how it is. I use large language models like Claude, like chat GPT, like Gemini, and then other
Starting point is 00:15:43 platforms that leverage large language models, right? Like perplexity. I use these hours, hours a day, right? It depends on the week. but anywhere from two to 10 hours, right? I'm using large language models since the date, since not just since they were released, since they, before they were released, right?
Starting point is 00:16:01 Since late 2020 is when our team started using, you know, the early GPT technology. And my background is also as a journalist. So the two things I know most about right now in my life are writing and large language models, right? Out of the box, Claude is great. It is. It sounds less.
Starting point is 00:16:21 less robotic than other large language models. I will say Claude is one A, Google is one B, and then chat GPT is number two, in terms of out of the box kind of content writing or the ability to sound human, right? But even with this new style preset here from open, or sorry, from Anthropic, it's not very steerable, right?
Starting point is 00:16:46 It's kind of hard-headed, if I'm being honest. So if your writing style is, is kind of vanilla, and there's nothing wrong with that, then I think you'll like this. If you have a unique writing style, I don't think you're gonna like this, right? So as an example, you know, I did try this out. I was gonna put it, you know, do a review.
Starting point is 00:17:03 I decided not to because I didn't think it was that great if I'm being honest. One of the problems, right, how people write, it's a style, it's a cadence. And I think in my many, many, many, many, hundreds of hours of experience, ChadGVT is still better at being steerable. And what that means being able to improve and steer a tone of voice where Claude, like I said, is better out of the box.
Starting point is 00:17:29 But even with this new style preference, I don't think it's very steerable. What you kind of get in the first iteration, even though you can go conversationally edit it, which I think is nice, it's not that great. And what people don't see on the back end, right? So let's say you generally write in short sentences and then, you know, you have multiple paragraph breaks, et cetera. It's just plain text, right? So you can do all that in your writing, but ultimately what Claude is seeing is plain text and is jumbling it all in giant paragraphs, right? Which is not saying like a telltale sign of AI, right? But it just makes it hard to read.
Starting point is 00:18:02 So hey, live stream audience, let me know. Are you liking this new Claude style? We did put it in our newsletter, obviously. So let me know what you think. Hey, Justin, thanks for joining us. Justin said, long time listener. I've been listening to you for a long time, first time seeing you live. If you listen to the podcast, come join us live, right?
Starting point is 00:18:22 It's fun. You can network with other people. Ask me questions if you have any. I'll try to do my best. All right. More AI news. So Alibaba, their Kwan team has released a new experimental AI model named QWQ32B preview. Love all these model names that are super easy to say.
Starting point is 00:18:45 So this new model from Alibaba's Kuen team. is a reasoning model. All right. So similar to open AIs O1. So this new model is notable for its 32.5 billion parameters allowing it to handle prompts up to 32,000 words in length. So it is what they're hoping, what the Quinn team is hoping to be a rival to open AIs O1 model, which was originally co-named Strawberry. Now we all know it is called O1 and it's designed to improve problem solving skills. So yeah, the big. biggest difference between traditional large language models and reasoning models is essentially these reasoning models go through kind of a chain of thought process under the hood,
Starting point is 00:19:29 doing kind of the initial prompting work that a human would normally do to drive better results. So these new reasoning models, right, it is kind of like a new tier. So you can't really compare, you know, quote unquote traditional large language models with reasoning models because they're very different in their functioning. So the new QWQ32 preview is one of the few AI models that facts checks itself, according to Quinn, reducing common errors, but also sometime taking longer to find solutions. So right now, users can run and download QWQ from the AI development platform, hugging face. And despite its advancements, the model has limitations, including potential language mixing and circular reasoning patterns,
Starting point is 00:20:12 as noted by the company itself on its blog. So the model from Quinn comes just about a week after R1 Light, which is the newest model from China-based startup DeepSeek. So yeah, in the last like 10 days, we've gotten two reasoning models out of China. First, R1 Light from DeepSeek. And now, QWQ32B preview from Alibaba's Quen team. Mouthful. Are we all going to be working for Uber? I don't know.
Starting point is 00:20:46 Maybe. All right. Our next piece of AI news. Uber Technologies is expanding beyond its ride-sharing origins by entering the AI development outsourcing market with a new division called Scaled Solutions. Interesting name choice there. So the move leverages Uber's gig economy expertise to provide AI model training and data labeling services, potentially reshaping its business model in opening new growth avenues. So Uber's entry is timely as global demand for human vetted data to train AI models is increasing, highlighted by the success
Starting point is 00:21:25 of companies like scale AI, right? So that's scale, that's this kind of scale AI's big thing, which is why I thought it was kind of, I'm guessing either strategic or ironic or maybe a little bit of both, that Uber decided to name this new division scaled solutions when the, one of the leaders in this space is scale AI. So Uber is recruiting contractors with programming skills, language proficiency, and cultural knowledge in countries such as India, the U.S., Canada, Poland, and Nicaragua, among others. So contractors will perform tasks like image, labeling, and text annotation with earnings based on completed tasks, aligning with Uber's existing payment model. This is weird, y'all. Let's be honest.
Starting point is 00:22:15 but this is, I don't want to get into this. I don't want to get into this too deeply, but I'm on the record. My very first, my very first podcast, like two years ago, was AI is going to take way more jobs than it creates. And I do think, and I've said this before, many times, traditional nine to five jobs are going to be gone in a couple of years, right? Or at least look very, very differently or a much fewer percentage of people having nine to five jobs.
Starting point is 00:22:41 So I don't want to get into the whole, like, AI is going to, take all our jobs and universal basic income. That's not what this is about. But I think a lot of us in a couple of years are going to be having similar positions like this where you are working in your area of expertise to help big tech companies gain, you know, domain specific and niche knowledge. That's, that's the future. I think we're all going to be having, you know, maybe a handful of these kind of side gig freelance jobs in our area of expertise, maybe in addition or maybe in, of traditional 9 to 5. So I don't know. What do you all think? It's kind of weird to think about, right? And people thought I was crazy when I said this until Reed Hoffman, one of the co-founders
Starting point is 00:23:25 of LinkedIn said the same thing. And then everyone's like, oh yeah, we should we should take a look of this and, you know, relook this, you know, nine to five culture and, you know, what full-time employment even means and the future of it, et cetera, right? When I said it, I was just weird. But when, you know, some of the smartest people in the world say it, they're like, oh, okay, maybe we should think about this. But I mean, this Uber, it's not surprising. It's not surprising, if I'm being honest. I think we're going to see a lot of other big tech companies shift to something like this because the higher quality human data that we have to start with, right, because one thing that we've seen is unfortunately we're seeing this
Starting point is 00:24:02 cycle of AI regurgitation, right? So so much of the quote unquote new content going into large language models training sets is created from AI. So, you know, we're getting unfortunately only this ugly cyclical data regurgitation thing where, you know, that it's lower quality data going into, or I should say lower quality new data that's going into the next version of model. So there is an increased emphasis on human created data, right? So turning all of our kind of knowledge into data, but having humans do it, you know, with working alongside hand in hand large language models. So it should be interesting to see how this how this one shakes out. So yeah, Fred is saying sounds like a security risk, maybe. One A Lee from YouTube says,
Starting point is 00:25:01 to be fair, you're not weird because you said the comments about jobs. That's good. I'm just weird for other reasons, right? Who else talks about AI every single day? It's extremely weird. All right. Our next piece of AI news, I'm excited about this one. So, in video has unveiled Fugato, a groundbreaking generative model poised, I think, to transform audio synthesis by blending music, voices, and sounds in unprecedented ways. So this new model employs cutting edge synthetic training methods and inference level combination techniques, allowing it to create unique sounds that have literally never been heard before by human ears, right, such as saxophones barking or voices, you know, singing classical music underwater.
Starting point is 00:25:51 So Fugato is not yet publicly available, and NVIDIA did not really release a lot of details about how, when, or if they would make the model publicly available, but a sample-filled website, which we linked to last week in our newsletter, and we'll link to again, demonstrates its vast range of capabilities. So according to NVIDIA researchers, crafting a meaningful training data set for Fugato posed challenges as it required revealing complex relationships between audio and language. So the team utilized a large language model to generate Python scripts that create instructions for different audio personas, enhancing the model's ability to synthesize and modify sound traits. So existing audio data sets, lacking inherent trait measurements,
Starting point is 00:26:35 were enhanced by using audio understanding models to create quote unquote synthetic captions, providing natural languages, language descriptions that quantify traits like gender, emotion, and speech quality. So audio processing tools for further analyze clips of an acoustic level measuring aspects like fundamental frequency variance and reverb. All right. So I'm going to try this here. We haven't done this in a while.
Starting point is 00:27:03 But let's go ahead and take a quick listen. So live stream audience, if you could, let me know if you're hearing this. Sometimes technology doesn't want me to share audio, but let's go ahead and take a listen here. Live stream audience, let me know if you can hear this. But we're going to listen to about 15 to 20 seconds from this NVIDIA kind of output here. Let's go ahead and take a listen. Or direct immersive and shifting soundscapes for film or audio productions. All right.
Starting point is 00:27:37 So this is the video in video released. And here is the prompt that went or the input that went into Fugato and then you can hear the output. So they don't read this. So it says create a sound where a train passes by and becomes a lush string orchestra. So you're gonna hear that. And then the next two that they describe,
Starting point is 00:27:56 you should be able to hear them. Here we go. That's very impressive. All right, let's listen to one or two more here quick. Instructing Fugato to extract audio elements from a sound clip such as isolating a voice track in a piece of music is just as easy. That is difficult to do.
Starting point is 00:28:45 So Marie is saying, this is great. The train whistle turning into an orchestra. So let's listen to one more, but that was combining multiple prompts. So it was combining or essentially multiple inputs. It was taking a flat music track, right, that had vocals and audio. And it separated them, which anyone that has worked in audio or video editing knows that is very, very hard. and very time consuming. And up until generative AI has kind of been impossible.
Starting point is 00:29:16 I think we're going to listen to one more here. Fugato also allows you to generate new speech samples. Kids are talking by the door. So this input said, in a calm voice with an American accent, say, kids are talking by the door. All right. So let's listen to it. And then they're going to say how they modify it. Kids are talking by the door.
Starting point is 00:29:35 And if you want a different delivery, Fugato can do that too. Then it says, turn this. calm voice into an angry voice. So again, using your output as your input and then modifying it with text. Ready? Kids are talking by the door. Super impressive. We haven't really seen anything like this yet.
Starting point is 00:29:58 So Jessica asking, will this work to clean up background noise on an old cassette tape? I think Jessica, in theory, if Nvidia releases it, that's what it can do with natural language, right? And yes, I do have quite a few contacts at NVIDIA. So there's a couple new NVIDIA kind of concepts or models that I'm going to try to get people on the show here. Suzanne on YouTube says, wow. Michael says, yes, there is the one that they changed the voice tone that we just heard. All right, let's get back to the AI news. And we're going to look at one more demo.
Starting point is 00:30:32 So AI video models went crazy this week. So Luma Labs released their Dream Machine 1.5 update. so it enhanced realism and motion consistency. So this update also includes a mobile app. Users can also in Luma Labs now generate consistent characters. That's a big one. And it supports conversational interactions. So not just having to completely reprompt.
Starting point is 00:30:58 So that's from Luma Labs, Dream Machine 1.5. Then we got updates from LTX video. So they released their tool as an open source option available on. hugging face spaces, allowing users to run their model locally. LTX video. Yeah, you will need a little bit of a power, though. I think optimal performance, they said, if you want to run this LTX video locally, which the fact that you can create AI, text to video models locally on a machine is wild.
Starting point is 00:31:32 Considering, like, a year ago, you know, even what we have now, which we would consider terrible a year ago was revolutionary. Revolutionary a year ago. So you can actually create pretty decent AI text-to-video videos. But you do need at least an InVIDIA-R-TX 4090 GPU, and I think that's like $2,000 for the chip. So yeah, you can run it locally, but you got to have a pretty powerful chip GPU there. And then last but not least, we have an update from runway.
Starting point is 00:32:07 So a couple things here. So one is their new expand video. So that's a feature that allows users to reframe existing videos into different formats, such as as an example going from a 9 to 16 or kind of a vertical video to a 16 by 9 and then filling it in and then doing it all over again. They also released an AI image generator called frames, right? So they're going about it the backwards way, kind of, right? Starting with AI video. And now they released AI.
Starting point is 00:32:37 frames. So frames, it offers high quality photo realism and advanced stylistic control, enabling the creation of consistent aesthetics across different worlds. All right. So let's just quickly now take a look here for our live stream audience. At least, you can see this new, this new frames feature. So let's go ahead and take a look. And I will try to narrate it for our podcast audience. So it's starting here with a 16 by 9 video, right? And then with a prompt, you can turn that into a vertical video. So it's creating, so it's this woman, you know, outside kind of looking, it's a close up of a woman, presumably at a park or a waterfall, but you don't know. So then it adds all this new vertical information and you see, oh, yes,
Starting point is 00:33:23 it is the top of a waterfall, but that is all AI generated. So what started as a close up kind of, kind of horizontal video turned into a vertical video. So now you have all this new AI generated information above this woman's head showing a waterfall. Then you can take that as an example and expand it even more. So you can just keep expanding as an example going from vertical to horizontal and you add all this new things. And then you can turn that large one into verticals. And then we'll see it turns it back, you know, back and forth.
Starting point is 00:33:55 So the same thing here, you have this, the second example of a man taking photos. It looks like in a cave, a vertical video, right? turns it horizontal. So now you see the beginning of what looks like a little lake inside of a cave or something like that. And then you see this new information generated by AI that's kind of like the top of the cave or light peeking in that was not previously there. So we go from again, we go from horizontal video to vertical at it with AI. And then again, taking that and making it even more. You know, so going multiple times, it really went from this close up of this man taking this photo and you don't really know what it is.
Starting point is 00:34:36 You're like, okay, it kind of assumes it's a cave and then it extends it, right, with this new generative extend. It adds, and you could just keep doing it more and more. It adds more detail, more visuals. This is a small feature that I'm personally pretty excited about. Yeah, Tara just said, wow. Shannon said brilliant. Jackie said great for various social post variations. Great point, Jackie, right?
Starting point is 00:35:02 So if you shoot something vertical, right? And you want to share that maybe on YouTube, you know, maybe you don't have something great. Well, you can use runway. Pretty impressive if you ask me. All right, we got a lot going on. Let's get to our last couple AI news stories. This one's weird.
Starting point is 00:35:19 So a viral video that just kind of silently dropped right around the Thanksgiving holiday here in the U.S. showed a small robot named Earby, convincing larger AI-powered robots to leave their posts, raising concerns about AI's influence on human-like interactions. This is so weird. So the incident reported by Sky News occurred in August at a Chinese startup, and was captured by closed circuit TV recording, but was only recently surfaced online. So this little AI-powered robot, you know, working its job at the
Starting point is 00:35:55 this Chinese AI startup approached its larger AI robot companions asking them if they were working overtime, to which the larger robot replied that it never gets off work. Then the small robot, Urbi, or Irby, I don't know what its name is, then invited it to quote unquote, come home. Yes, there's actual transcripts. I don't know if this is a huge marketing stunt or what, or if this is just an eerie look into our future where AI robots are talking to each other. Anyways, Urbeye convinced them in all of these other robots.
Starting point is 00:36:29 I think there was five of them. Literally, quote unquote, left their job, right, working here at this factory and went home. So unexpectedly, other robots joined leaving their positions, which researchers did not anticipate, highlighting the potential oversight in AI programming. So the video has sparked discussions on social media with users humorously referencing movies like iRobot and Terminator while expressing concerns about AI autonomy. Yeah, I don't even know what to say about this one. This is weird.
Starting point is 00:37:05 It's very weird. It's very weird. But yeah, if you want to see that video, we'll be sharing it in our newsletter. All right. So does Notebook L.M have some competition, at least when it comes to their AI-generated podcast, maybe so. So 11 Labs has launched a new feature called GenFM, which is available on the 11 Labs Reader iOS app.
Starting point is 00:37:28 So this enables users to create multi-speaker podcasts by uploading different types of content, such as YouTube videos, text, or documents. So this innovative feature supports 32 languages, including English, Hindi, Portuguese, Chinese, Spanish, French, German, and Japanese, making it accessible to a global audience. So like Notebook L.M's audio overhaul, features feature where it creates a deep dive podcast, these two AI hosts, you can upload all of your
Starting point is 00:38:00 content, and it creates a pretty engaging kind of two speaker, kind of a piece of podcast, right? This does the same thing. So Gen FM from 11 Labs. It only works right now in their app, but it automatically selects two voices from over a dozen available options. So that kind of separates it or gives it a nice feature that notebook L.M does not have. adding human-like elements such as ums and ahs to create a more natural listening experience.
Starting point is 00:38:30 So, uh, yeah, I know these AI generated podcast summaries can add ahs and ums, but can they add random hot bit, uh, you know, random rants like me? Can they drag on a new story that should take 30 seconds to read and turn it into two minutes like me, right? I say that, you know, kind of tongue and cheek, but, you know, maybe I should just be a little more concise here. All right. Two last stories. We're saving some of the biggest ones for last. So according to reports, Open AI is considering advertising to produce, to boost its revenue. So Open AI valued at a reported $150 billion is exploring advertising as a new revenue source,
Starting point is 00:39:15 according to the financial times. So the company aims to capitalize on its leading position in the AI sector with chat GPT achieving over 250 million weekly users. So OpenAI is aiming to reach one billion users by next year by building new data centers, reflecting its growth ambitions following significant funding. So OpenAI's CFO, Sarah Fryer, has emphasized a thoughtful approach to implementing ads, ensuring they align with the company's growth strategy. So, despite rapid revenue growth to about $4 billion annually, according to reports, Open AI is facing higher costs, planning to spend over $5 billion a year in the near term. So yeah, they may be losing money, according to reports.
Starting point is 00:40:05 But also, the company has been hiring advertising talent from tech giants like Meta and Google, indicating serious consideration of an ad model in the near term. So CEO, Sam Altman, is reportedly warming up to the idea of ads, though internal discussions continue on how they might be integrated. OpenAI's revenue largely right now comes from its API and chat GPT licenses and the company is seeking more lucrative opportunities. So Friar noted that ad models can shift focus from users to advertisers, but see unexplored potential in current business strategies.
Starting point is 00:40:42 So the move towards advertising reflects broader trends in the AI industry where companies like perplexity in Google are already piloting AI ad models or ads in their AI generis summations, AI search models, whatever you want to call it. All right. Last but not least, Elon and Open AI are fighting again. Why? So Elon Musk's attorneys have filed for a preliminary injunction against Open AI, its co-founders, and Microsoft claiming anti-competitive behavior. Yes, this is brand new.
Starting point is 00:41:23 If you're like, wait, I've heard you talk about this before. Well, this is just a preliminary injunction, a new one. So the motion was filed in the U.S. District Court for the Northern District of California, and it accuses open AI in its associates of illicit activities and seeks to halt them. So allegations include discouraging investors from backing Open AI's rivals like Elon Musk's own AI company, XAI, right? makers of the grok chatbot that I don't know, anyone that uses. So the filing also claims OpenAI's conversion to a for-profit entity and self-dealing transactions harm marketplace competition.
Starting point is 00:42:05 So Musk's legal team argues that irreparable harm will occur if the injunction isn't granted aiming to preserve OpenAIs nonprofit character. The lawsuit revives Musk's previous legal battles with, with Open AI, accusing the company of straying from its original nonprofit mission. So Musk was obviously a co-founder of Open AI and left in 2018 over disagreements about the company's direction and then later founded XAI. And XAI recently announced a $5 billion funding round, positioning it as one of the most funded AI companies globally.
Starting point is 00:42:44 So this injunction alleges Open AI's actions deprive XAIs of capital by extracting commitments from investors not to fund competitors. So OpenAI is backed by Microsoft with a reported $13 billion investment faces accusations of illegally sharing proprietary information and engaging in self-dealing. So Musk's attorneys argue that maintaining OpenAI's nonprofit status is crucial to protect its founding mission and public interests. Open AI spokespersons dismiss must claim as baseless, stating the lawsuit lacks merit. So I don't know. I'm not going to go too far into this. I did an entire episode once that talked about this original lawsuit and just kind of I poked more holes in it than, you know, Swiss cheese at the grocery store. This doesn't make
Starting point is 00:43:37 sense. What Elon Musk is saying makes absolutely no sense. And if I had to agree here, I'm obviously siding with Open AI. Okay, because Elon Musk is claiming, that, you know, making these anti-competitive claims and is essentially saying Open AI is forming a monopoly. They're too big, too good. No one can compete. And we can't raise money, right? Because what Elon Musk is saying, well, you know, Sam Altman and Open AI here are asking investors not to invest in other AI companies, which we've reported on. So, I mean, they can, right? At least from what we've seen in reportings, there's no legal agreement that says they cannot, at least the reporting we've seen, has said that OpenAI is
Starting point is 00:44:27 asking its biggest investors not to invest in its rivals, which seems like common sense, right? If you're cashing a billion dollar check from someone, you probably want to hope that that same investor is not funding one of your rivals. But also, this doesn't make sense because Elon Musk is also founding XAI in this GROC AI chatbot, right? So on the left hand, Elon Musk is saying GROC is going to be the most powerful AI model in the world, right? And he's building all these, you know, compute gigafactories, right, or whatever he's saying with like a trillion Nvidia GPU chips, right?
Starting point is 00:45:08 So on the left hand, he's saying, oh, GROC, like our AI chatbot is, you know, one of the best in the world. and it's got to be the most powerful. So the left hand, you're saying, like, kind of like, hey, we're the best. I'm the best. Grock is the best. But then on the right hand, in court, you're saying, oh, it's anti-competitive. No one can compete.
Starting point is 00:45:26 So is Elon Musk's left hand talking to his right hand doesn't seem like it. And that's why a lot of people are writing off this lawsuit, even though people are going to be talking about it a lot. I don't know. I'd have to side with OpenAI and say this lacks merit. It's very legal for a nonprofit company to go through. through proceedings to convert to a for-profit company. Yeah, yeah.
Starting point is 00:45:50 And the timing, the timing of this lawsuit, like Fred said, yeah, particularly interesting timing here. Michael just saying, oh, M.g. Musk, let it go, face palm. Shannon says loves the rambling. All right. Well, we'll wrap. We will wrap the rambling up. So real quick here, the quick, too long didn't listen version of AI News.
Starting point is 00:46:13 So OpenAI's SORA AI tool was leaked after some artist backlash. Anthropic shipped a lot. They shipped their new model context protocol, as well as their new customizable writing styles. We saw a new reasoning model from Alibaba's Quen team called QWQ32B. Uber is apparently venturing into AI development outsourcing with its scaled solutions. InVIDIA released a new model called Fugato, a audio model that does a bunch of crazy things. The AI video companies went completely bonkers over our kind of quote unquote Thanksgiving break here in the U.S. With Luma Labs, LTFs, LTX video and runway, all releasing a bunch of things.
Starting point is 00:47:05 Speaking of releasing a bunch of robots followed each other apparently out the door. So, you know, some weird things to worry about when it comes to. AI robots talking and working with each other. 11 labs released Gen. FM offering a very similar feature or capability to notebook L.M's audio overviews. OpenAI is considering advertising. So we might be seeing a bunch of ad results in chat GPT soon, according to reports. And then last but not least, Elon Musk is seeking a new injunction against OpenAI and Microsoft,
Starting point is 00:47:40 alleging anti-competitive practices. All right, that's it, y'all. I hope this was helpful. If so, tell someone about it. Don't be a jerk. Share the love, y'all. So if you are listening here on the live stream, go ahead and click that repost button, tag a friend,
Starting point is 00:47:55 someone that needs to hear this. If you're listening on the podcast, please make sure you follow the show. You know, click that follow button or whatever that button says. And if you could leave us a rating, that would be great as well. All right, thank you for tuning in. If you haven't already, go to your everydayaI.com.
Starting point is 00:48:10 Like I said, we will be recapping, Not just today's podcast. We reference a lot of things, videos, you know, all these new releases. You can go read about them, see them, watch them. It'll all be in our newsletter as well as on our website, your everyday AI.com. It is a free generative AI university. You can sort all of our past episodes by category. Whatever you want to learn, we have it there.
Starting point is 00:48:31 Thank you for tuning in. Hope to see back tomorrow. And every day for more everyday AI. Thanks, y'all. Meet Firefly AI Assistant. Now live in Adobe Firefly, the Allman One Creative AI Studio. Just describe what you want to create in your own words and the assistant handles the rest, orchestrating multi-step workflows across Adobe Creative Cloud apps,
Starting point is 00:48:57 including Photoshop, Premiere Express, and more in one conversational interface. You direct the outcome while the assistant accelerates execution. Stand control with the ability to step in and refine at any time. See it today at firefly.adobie.com. And that's a wrap for today's edition of Everyday AI. Thanks for joining us. If you enjoyed this episode, please subscribe and leave us a rating. It helps keep us going.
Starting point is 00:49:28 For a little more AI magic, visit Your EverydayAI.com and sign up to our daily newsletter so you don't get left behind. Go break some barriers and we'll see you next time.

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.