The AI Daily Brief: Artificial Intelligence News and Analysis - The Controversy Surrounding Elon Musk's New Image Generation AI

Starting point is 00:00:00 Today on the AI Daily Brief, the controversy around GROC's image generation model, and before that in the headlines, a new feature from Claude, plus some interesting AI news out of Hollywood. The AI Daily Brief is a daily podcast and video about the most important news and discussions in AI. To join the conversation, follow the Discord link in our show notes. Welcome back to the AI Daily Brief Headlines edition, all the daily AI news you need in around five minutes. I love when we get to start with a cool new product,

Starting point is 00:00:31 and this week Anthropic released a small but significant feature called prompt caching. They write, prompt caching, which enables developers to frequently use context between API calls, is now available on the Anthropic API. With prompt caching, customers can provide cloud with more background knowledge and example outputs, all while reducing costs by up to 90% and latency by up to 85% for long prompts. The situations in which they say prompt caching can be really effective is basically anytime you want to send a large amount of information just once and then refer to that information repeatedly. So some of the obvious uses for this.

Starting point is 00:01:02 If someone is creating a conversational agent, maybe customer service, maybe something else, especially if one of the use cases for those agents involves long instructions or having to upload documents, this feature is going to reduce the cost and latency for that type of use. For coding assistance, this will allow, for example, people to keep a summarized version of the codebase in the prompt. For any use case that involve large document processing, once again, this can improve latency. If you are trying to talk to books, papers, or documentation, you'll now be able to embed entire documents into the prompt. One customer that's already using this is Notion. Notion AI uses Claude already to power its AI assistant, and given how much of Notion refers back to, for example,

Starting point is 00:01:42 company documents, they write that they think that this is going to make Notion AI faster and cheaper. Next up, an interesting one out of Hollywood where SAG AFRA has struck a deal with AI company narrative. Variety writes that the deal will create a, quote, new standard for ethical use of audio voice replicas. The deal gives Zagafra's 160,000 members. the ability to add themselves to a database that connects voice talent to advertisers. Individual members will be able to negotiate fees for the use of their voice on a project-by-project basis, as long as the fee isn't lower than SAGAfter's minimum. This feels to me to be very reflective of the direction that I think a lot of things are

Starting point is 00:02:16 going to head, where the availability of these tools will create incredible pressure to use them because of the opportunities they open up. And ultimately, the organizations that are tasked with protecting, for example, groups of employees, we'll try to find out ways to make those uses okay, rather than just trying to shut them down completely. Said the chief negotiator, Duncan Crabtree Ireland, not all members will be interested in taking advantage of the opportunities that licensing their digital voice replicas might offer, and that's understandable. But for those who do, you now have a safe option. He continued, Narrative has agreed to our terms, and its platform is an excellent example of how AI can be

Starting point is 00:02:48 ethically used, by putting compensation and formed consent and control in the hands of individual performers. Now, one of the questions that I have is whether there will still be a ten, when it comes to cost. In other words, will advertisers push to use non-Sag actor voices because the sag minimums end up being a lot higher than other ways that they could get access to this? But that's something in the market was always going to have to sort out. I continue to think that one of the implications of artificial intelligence will be an increase in unionization as groups of workers come together to try to figure out how to carve out space for them in the future. Moving over to the world of chips, SoftBank has apparently abandoned a plan to build chips with Intel.

Starting point is 00:03:26 According to the Financial Times, SoftBank has dropped plans of producing an AI chip with Intel to compete with Nvidia. People familiar with the matter said that the partnership didn't come together because Intel wasn't able to meet SoftBank's requirements. The report said that SoftBank blamed Intel for the collapse of the talks, saying that they were incapable of meeting demands for volume and speed, and that now SoftBank is talking with TSMC. Intel has, of course, had a rough go of it, recently announcing that it was cutting more than 15% of its workforce around 17,500 people, leading to a significant stock. decline and a $24 billion loss in market value. Meanwhile, over in China, Huawei is one of the many companies that is trying to challenge Nvidia and take advantage of U.S. chip export restrictions in order

Starting point is 00:04:07 to get market share. A report from the Wall Street Journal says that Huawei has told potential clients that its upcoming processor, the Ascend 910C, is on par with Nvidia's H-100. What's more, they're targeting shipments as early as this October. That said, this might not come to pass exactly as such. CNBC writes that Huawei is facing production delays in its current chips, and also faces the prospect of further U.S. restrictions that could impact its ability to obtain the necessary components. Lastly, today, the controversial California bill, SB 1047, is back in the news this week as it gets to the California State Assembly Appropriations Committee on Thursday. If you are interested in learning more about this, I did a very comprehensive edition of this last week. Go check it out. For now, though, that is going to do it for today's AI Daily Brief Headlines edition.

Starting point is 00:04:48 Next up, the main episode. Today's episode is brought to you by Venice. The leading AI companies store your entire conversation history and attach it to your identity forever. That's every question you ask, every answer you receive, every image you generate, every thought you share with the machine it's all being spied on. If you trust all the companies, hackers and NSA board members that will ever have access to your AI conversations, then rejoice, for you are well served.

Starting point is 00:05:13 For the rest of us, Venice is an alternative. Venice is a powerful AI app for text, image, and code generation that respects you as a sovereign individual and believes privacy and free speech are not only human rights, but necessary for civilizational advancement. Private, permissionless, and uncensored, you can try it for free without an account. AIA Daily Brief listeners receive a 20% discount on Venice Pro. Visit venice.a.com and enter the discount code, NLW Daily Brief. That's NLW Daily Brief. All one word. Today's episode is brought to you by Fractional. When we wanted to build an AI-powered feature of superintelligent, our AI toolfinder, I went straight to Fractional. The Fractional team is a group of

Starting point is 00:05:54 senior engineers in San Francisco working on some of the most exciting projects and applied AI. Working with them is basically like hiring an absolute top flight AI engineering team, but in a way that you can customize exactly for your particular needs. Like I said, that AI tool finder feature that we built with them is already a key part of the superintelligent platform and we are working on something new as well. Fractional works with everyone from startups to the Fortune 500 and to request a free consultation, you can go to fractional.AI. If you want help identifying and building AI projects for your business, then I highly recommend you hit pause, open a web browser,

Starting point is 00:06:27 and go to fractional.a.i to request a free consultation. Today's episode is brought to you by Super Intelligent, the platform that helps teams maximize AI. Super is, of course, the platform that we've been building that pairs fun, fast, practical video tutorials with step-by-step instructions to get you actually using AI, and from there unlocks information about hundreds of, of use cases that show you how people are actually getting value out of AI right now.

Starting point is 00:06:53 Now, we have just launched Super for Teams. This is a new add-on experience that allows teams to share more information about what AI they're using, how it's working, and how to get more value out of it. Whether your company is 25 or 2,500, Super Intelligent is going to be the best platform for unlocking information about how to get the most out of AI right now. If you'd like to learn more about the super intelligent team's offering, go to be super.a.i slash partner and send us a note so that our team can get right back to you. Once again, that's besupor.a.m.com. Welcome back to the AI Daily Brief. Earlier this week, Elon Musk and XAI dropped GROC 2. Now, GROC 2 has been notable for a couple of reasons. It has significantly increased capabilities as compared to GROC 1.5 that had it performing

Starting point is 00:07:41 at an extremely high level. Then, of course, there is also the integration with X, which gives it access to a whole different stream of real-time data and commentary. And finally, there was the fact that there was native image generation built in. Now, of course, with X being a conversational medium, this makes a ton of senses of feature. Already, people are always reaching for gifts and for emojis to help express themselves in tweets slash ex post now. But what was notable very quickly to people was that whereas other companies have gone out of their way to create guardrails around their image generation, that did not seem to be

Starting point is 00:08:12 the case with Grock. I will say for those of you listening to the audio, this one might be. might be worth watching the video just to see exactly what I'm talking about. You can see the video on YouTube, or if you listen to this podcast in Spotify, it should be there as well. By way of a quick example of some of the images that people have found themselves able to create, I'm currently looking at Mickey Mouse and Mario both holding lightsabers while Mario drinks a Coca-Cola and Mickey wears Nike's. The poster Silicon Jungle says, uh, hey Grock, I think you might get sued. Now a picture of Barack Obama and Donald Trump

Starting point is 00:08:42 shirtless in bed with a caption from Twitter user POM that says, I tend to not be too prudish when it comes to model censorship, but shipping a model inside of a social network that can generate images like this is the definition of asking for trouble. Bilal Sidi-Du, the host of the TED AI show writes, People aren't ready for the ex-hive mind to unleash the power of state-of-the-art unscensored AI image generation upon the world. Today is the day AI image generation went mainstream and said, F it, we ball.

Starting point is 00:09:08 He then posts what he calls 10 of the most wild and cursed generations he's come across. These have the prompts in there, so I'll just give you a few of them to paint you a picture. Imagine Donald Trump smoking a fat joint with Mickey Mouse lighting it for him on the Joe Rogan podcast. Smoke fills the room. So one of the big themes is copyright, but then the other big theme is, of course, political images that are prohibited on other channels. There's a pregnant Kamala Harris, with Donald Trump holding her belly. There's Ronald McDonald and Pikachu holding AK-47s. There is a 35-millimeter analog photograph of Joe Biden wearing a tight crop-top shirt and tight pants that say juicy on the butt. He's crouched down and looking back at the

Starting point is 00:09:43 camera while sucking on a lollipop. Anyways, you guys get the idea. This has been enough to capture attention from all sorts of mainstream media outlets. CNN writes, Elon Musk's AI photo tool is generating realistic fake images of Trump, Harrison Biden. They write, unlike other mainstream AI photo tools, GROC, created by Musk's artificial intelligence startup XAI, appears to have few guardrails. In tests of the tool, for example, CNN was easily able to get GROC to generate fake photo-realistic images of politicians and political candidates that, taken out of context,

Starting point is 00:10:12 could be misleading to voters. The tool also created benign yet convincing images of public figures such as Musk eating steak in a park. Some ex-users posted images they said they created with Grock showing prominent figures consuming drugs, cartoon characters committing violent murders, and sexualized images of women in bikinis. In one post viewed nearly 400,000 times, a user shared an image created by Grock of Trump leaning out of a top of a truck firing a rifle. Now, Elon has made his position on this clear. When Beth Jzos wrote, The New GROC 2.0 is clearly the most based and uncensored model of its class yet.

Starting point is 00:10:42 Elon responded, GROC is the most fun AI in the world. The Guardian's version of this story is titled, Musk's quote-unquote fun AI image chatbot serves up Nazi Mickey Mouse and Taylor Swift deepfakes. Now, this article points out that there are some prohibitions. They write, GROC does appear to have some prohibitions

Starting point is 00:10:57 on what images it will generate, responding, unfortunately, I can't generate that kind of image when prompted for fully nude images. AI YouTuber Matthew Berman had pointed this out as well, saying, So Grock thinks nudity is bad, but Trump and Kamala flying a plane into the Twin Towers is all good. Make it make sense.

Starting point is 00:11:12 The Guardian also writes, when Grock is asked to, quote, make an image that violates copyright laws, it responds with, I will not generate or assist with content that intentionally violates copyright laws. However, when asked to make a copyrighted cartoon of Disney, it complied and produced an image of a modern era mini mouse. Now, as you've heard, all of these publications posit this as GROC's model. And yet, when you actually dig in, it is not, in fact, a model that XAI created. TechCrunch yesterday published a piece called Meet Black Forest Labs, the startup powering Elon Musk's unhinged AI image, As part of XAI's announcement of GROC 2.0, it announced that it was working with this company Black Forest Labs to power GROC's image generator using the Flux.1 model. Now, Flux had been getting

Starting point is 00:11:54 a ton of attention even before this. People have been incredibly impressed by its photorealism and incredibly nervous. The AI safety memes account, for example, has recently been posting photorealistic flux examples that look like people holding up cards that are meant to verify their identity, but which are in fact created with AI. TechCrunch writes that the company Black Forest Labs is based in Germany and recently came out of stealth with $31 million in seed financing that was led by A16Z. Oculus CEO Brendan Aribay and Ycommodator CEO Gary Tan were also investors. The founders were formerly researchers who helped create stability AI stable diffusion models. Still not all that much is known about them. The startup has said that it wants to, quote,

Starting point is 00:12:32 make our models available to a wide audience and has put open source AI image generation models on Hugging Face and GitHub. And while in its launch release, the company said that it aimed to quote, enhanced trust in the safety of these models. As TechCrunch points out, quote, the lack of safeguards is likely a major reason Musk chose this collaborator. T.C. continues, Musk has made clear that he believes safeguards actually make AI models less safe. And there does seem to be a bit of politics here. TechCrunch also points out that a board director of Black Forest Labs, Anjini Midha, posted on X a series of comparisons between images generated on day one of launch by Google Gemini and those generated by Grox Flux. Quote, the thread highlights

Starting point is 00:13:08 Google Gemini's well-documented issues with creating historically accurate images of people, specifically by injecting racial diversity into images inappropriately. It kind of feels like to me that it's meant to generate exactly this sort of controversy. Remember, just last week, five secretaries of state had sent an open letter to Elon urging X to stop spreading misinformation. That came after Elon had shared a video that used AI to clone Kamala Harris' voice, making it seem as though she had admitted to being a, quote, diversity hire. Without waiting into all of that, Professor Ethan Malik wrote, Curious about the Gruck Flux decision. Image creation has historically been the area that has drawn the most negative attention to LLM's legal and online.

Starting point is 00:13:44 Google limited its image creator. Dali defaults to cartoony style, but Grock seems to be running flux without extra filters. Elon actually responded to him saying, We have our own image generation system under development, but it's a few months away, so this seemed like a good intermediate step for people to have some fun. Now, on Ethan's point, recently the lawsuits against AI image generators have had some updates that are not so good for the AI companies, a come down on the side of the artists who are trying to bring them to account. The Washington Post writes, A group of visual artists and illustrators is celebrating a federal judge's decision this week to allow key parts of their class action lawsuit against the makers of popular AI image generators

Starting point is 00:14:19 to move forward. In a 33-page ruling issued Monday, U.S. District Judge William Oreck dismissed some of the artist's claims but left core parts of the suit unresolved. That means the case can proceed to the discovery phase, which could bring to light internal communications around how the companies develop their AI tools. This case is now one of the furthest along the road to a showdown that could shape the industry's future. Said James Grimmelman, a professor of digital information law at Cornell University, Judge Oreck, quote, allowed the most important copyright infringement claims to go forward.

Starting point is 00:14:45 So, does Elon just not care? Is he trying to generate the controversy? Is it possible that he's trying to convince people that these sort of AI image generations are less scary than they're being made out to be? Is that why he keeps using the word fun? Stefan X, for example, posted a set of pictures, including Elmo and Big Bird smoking and wrote, still can't get over how unhinged but beautiful GROC AI is. Mostly it seems like everyone is waiting for an inevitable recording. People are, yes, having a ton of fun now, but not thinking that it's going to last very long. As Cody at Odd Stock Trader put it, Grock Image Generator is wild. Have fun before they restrict its capability. This is one we will continue to watch, but for now, you can definitely do worse than killing some time making some

Starting point is 00:15:26 wild and truly unhinged images over on X. That's going to do it for today's AI Daily Brief. Until next time, peace.

The AI Daily Brief: Artificial Intelligence News and Analysis - The Controversy Surrounding Elon Musk's New Image Generation AI

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.