The AI Daily Brief: Artificial Intelligence News and Analysis - A New OpenAI Model Coming?

Episode Date: August 13, 2024

Rumors are swirling around OpenAI’s potential release of a new advanced reasoning model, codenamed “Strawberry” or “Q-Star.” Speculation is fueled by tweets from an anonymous account claimin...g insider knowledge, as well as cryptic hints from OpenAI’s own team. Plus an epic future of search team up between Perplexity and Polymarket.

Transcript
Discussion (0)
Starting point is 00:00:00 Today on the AI Daily Brief, is OpenAI's more capable reasoning model coming tomorrow? Before then in the headlines, perplexity and polymarket come together for an epic future of search team-up. The AI Daily Brief is a daily podcast and video about the most important news and discussions in AI. To join the conversation, follow the Discord link in our show notes. Welcome back to the AI Daily Brief Headlines edition, all the daily AI news you need in around five minutes. We kick off today around one of the themes that we've been exploring a lot lately, which is the future of search. It seems like there is a big bet that in a world newly enabled by generative AI, search is not just going to be a set of links like it is now with Google and like it has been for 20 years with
Starting point is 00:00:43 Google, but instead a combination of that and some amount of curatorial synthesis that gives you answers in complete written-up form. However, a new partnership from Perplexity and Polymarket shows that going forward search may also include other types of data in new ways. They tweeted earlier today, we're thrilled to announce our partnership with Polymarket. Now when you search for events on perplexity, you'll see news summaries paired with real-time probability predictions, such as election outcomes, market trends, and beyond. So Polymarket, for those of you who don't know, is a prediction marketplace that lets people bet on real-world events. It is the latest type of technology platform to draw the ire of Elizabeth Warren, and it is now going to be integrated into perplexity.
Starting point is 00:01:24 TechCrunch writes, when users click on an event on Polymarket, they will now see a summary of news related to the event based on search results from Perplexity. There's also a search box that you can use to ask more questions. Perplexity will also use some data from polymarket, such as election trends, to show visuals and answers. Now, interestingly, in this news story, there's also a little bit of insider baseball when it comes to Perplexity's business model. The chief business officer over at Perplexity told TechCrunch that, quote, the API business is not a priority for us as we are a consumer-focused company. But still, API usage is growing with more than 25,000 developers using our API. We have a unique offering that pulls in answers via different internet sources. The way we think about
Starting point is 00:02:00 API right now is that it's a means to grow the brand, but not the end. Could we see a perplexity that diversifies its consumer business model with a more B2B model through their API in the future? Certainly doesn't seem insane. Shane Copeland, the CEO of Polymarket, said, when I first built Polymarket, I would tell people I was building the search engine for the future. Now that's coming to life. Next up, another little team up to discuss. Invitya is collaborating with the state of California. to train California residents on AI. The partnership between the state and the company is aiming to help train 100,000 students,
Starting point is 00:02:33 college, faculty, developers, and data scientists to harness AI. Business Insider writes that as part of that training, educators in California can get a certification from the Nvidia Deep Learning Institute University Ambassador program. But of course, as much as this is about Nvidia making a bet on the state of California, it also comes at a time that California's relationship with AI is at an inflection point due to SB 1047.
Starting point is 00:02:53 If you want the background on what's going on that, go check out the episode that I dropped last week. I tried to provide a pretty broad overview without too much editorialization, although of course I couldn't avoid it entirely. But to those who are against it, they think that it could have major implications for whether businesses actually stick around to do business in California when it comes to AI. To more California companies in our next story, TechCrunch reports that Meta and Universal Music group have updated their licensing agreement and in it started to deal with AI music. TechCrunch again writes, Meta and UMG announced on Monday the expansion of their multi-year licensing agreement,
Starting point is 00:03:27 which enables users to share songs from UMG's music library across Meta's platforms without violating copyright. What's most notable about the new agreement is that it states that the two companies are addressing unauthorized AI-generated content. This refers to songs being scraped by AI systems, often without the consent of the original creators. Said Michael Nash, chief digital officer and EVP at UMG, we look forward to continuing to work together to address unauthorized AI-generated content that could affect artists and songwriters so that UMG can continue to protect their rights both now and in the future. Meta is treading carefully when it comes to AI-generated music, as TC writes, they're only releasing generative AI models, including audio craft music gen, and
Starting point is 00:04:02 Jasko that are trained with, quote, meta-owned and specifically licensed music. Now, I continue to think that markets are going to address this particular issue and that the record labels are going to do what they always do, which is swoop in to get a big cut of the new area, but exactly how that plays out we haven't yet seen. Lastly today, two updates from big tech companies in their AI strategy. analysts believe that Apple could charge up to $20 for some of its Apple intelligence features. Now, of course, Apple intelligence is being delayed slightly from when it was anticipated rolling out with iOS 18, but now analysts are speculating that these features are not going to be for free. Will that impact how much mainstreaming they actually do? Or is Apple such a
Starting point is 00:04:39 behemoth that even with a subscription model, they're still likely to be able to bring on a whole new raft of users? Finally, and yet another indication of how hard it is to be a startup creating AI features, Google Meet has announced a new Take Notes for Me feature, which seemingly imitates the functionality of a ton of startups out there who are AI note-taking apps for things like Google Meet and Zoom, but natively within the Google workspace. So is this a firefly slash otter killer? Does it just mean that all AI startups are destined to be replaced by functionality from the big tech companies? Only time will tell. That's going to do it for today's AI Daily Brief Headlines edition. Next up, the main episode. Today's episode is brought to you by
Starting point is 00:05:17 Venice. The leading AI companies store your entire conversation history and attach it to your identity forever. That's every question you ask, every answer you receive, every image you generate, every thought you share with the machine it's all being spied on. If you trust all the company's hackers and NSA board members that will ever have access to your AI conversations, then rejoice, for you are well-served. For the rest of us, Venice is an alternative. Venice is a powerful AI app for text, image, and code generation that respects you as a sovereign individual and believes privacy and free speech are not only human rights but necessary for civilizational advancement. Private, permissionless, and uncensored, you can try it for free without an account. AIA Daily Brief listeners receive a 20% discount on Venice Pro.
Starting point is 00:05:57 Visit venus.a. slash NLW and enter the discount code, NLW Daily Brief. That's NLW Daily Brief. All one word. Today's episode is brought to you by Fractional. When we wanted to build an AI-powered feature of Superintelligent, our AI tool finder, I went straight to, to Fractional. The Fractional team is a group of senior engineers in San Francisco working on some of the most exciting projects in applied AI. Working with them is basically like hiring an absolute top flight AI engineering team, but in a way that you can customize exactly for your particular needs. Like I said, that AI tool finder feature that we built with them is already a key part of the superintelligent platform and we are working on something new as well. Fractional works with everyone
Starting point is 00:06:39 from startups to the Fortune 500 and to request a free consultation, you can go to fractional.aI. If you want help identifying and building AI projects for your business, then I highly recommend you hit pause, open a web browser, and go to fractional.aI to request a free consultation. Today's episode is brought to you by Super Intelligent, the platform that helps teams maximize AI. Super is, of course, the platform that we've been building that pairs fun, fast, practical video tutorials with step-by-step instructions to get you actually using AI. And from there, unlocks information about hundreds of use cases that show you how people, are actually getting value out of AI right now. Now, we have just launched Super for Teams. This is a new add-on experience that allows teams to share more information about what AI
Starting point is 00:07:26 they're using, how it's working, and how to get more value out of it. Whether your company is 25 or 2,500, superintelligent is going to be the best platform for unlocking information about how to get the most out of AI right now. If you'd like to learn more about the superintelligent teams offering, go to BESuper.a.I. slash partner and send us a note so that our team can get right back to you. Once again, that's B-super.a.ai slash partner. Welcome back to the AI Daily Brief. We started talking about this last week, but boy, has the hype and rumor train around OpenAI's next model gotten nothing but louder since then. The main character in this particular drama is a Twitter account called at I Rule the World M.O.
Starting point is 00:08:09 This person or perhaps AI has been tweeting nonstop for a couple of weeks next to with increasingly specific predictions, interspersed with big, broad, grand vision tweets like this one. Imagine a world where every human has access to the power of a thousand minds. That's the promise of Project Strawberry, and it's closer than you think. So today we're checking in on the rumors, which of course could just be rumors and everything we know so far. So by way of background, Strawberry is apparently the latest name for what was originally called Q-Star. You might remember Q-Star from around the time that Sam Altman was fired from OpenAI, before being rehired, there were tons of speculation that this more advanced model had freaked some
Starting point is 00:08:49 people out internally, and they were concerned around how Sam Altman was likely to go try to commercialize it, and that was the motivation for his firing. Now, subsequently, no one has tried to make that claim, but try to tell that to the internet denizens. Then back in July, Reuters reported that the new name for Q Star was called Strawberry, and that yes, indeed, it was about a better reasoning model. All of this further heated up last week, when a new model showed up on the Limsys chatbot arena, which was called Anonymous Chatbot. Now, OpenAI has a history of releasing their models as this sort of anonymous chatbot
Starting point is 00:09:21 in advance of them being officially released until people were looking for evidence that this might be Q Star or now Strawberry. writes NDTV, Anonymous chatbot uses chain of thought reasoning, a stark step away from the current methods of reasoning like pattern recognition. If new chatbots are going to start showing up that use chain of thought and their reasoning capabilities
Starting point is 00:09:37 were likely to see another shift in the AI race. Then, of course, there's the fact that OpenAI's team seems to be part of the team. Sam Altman tweeted, I'd love summer in the garden with a picture of some strawberries. And when that same anonymous user I ruled the world wrote, welcome to level two. How do you feel? Referring to Level 2 on OpenAI's new chart of the path to AGI, which is reasoning level, Altman replied to that account, Amazing, TBH.
Starting point is 00:10:02 As Tom's guide points out, however, there's another reason to think that maybe OpenAI would do this this week. And that is that there is a Google event this week, made by Google, which happens tomorrow on Tuesday, August 13th. Tom's writes that the event has very few unknowns when it comes to what Google will likely announce, given that they've already shown off the Pixel 9 Pro and Pixel 9 Pro fold in videos, and so the speculation is whether the focus comes back to AI. As Tom's points out, Google has lost some of their steam when it comes to AI on the phone, and certainly as part of the larger generative AI conversation, and speculates around what some of the AI features that they might announce instead would be. So is Open AI just going to try to cut the media legs out from under this
Starting point is 00:10:40 Google event? Well, that same anonymous account, I Rule the World, has said that tomorrow at 10 a.m. Pacific Time is when some big announcement from OpenAI will be made. This morning, they wrote, attention isn't all you need. New architecture announcement, August 13th at 10 a.m. Pacific Time, the singularity begins. A couple days before that, that account had given more of what it argued was its insider information. On Project Strawberry slash Q Star, they said, this is what Ilya saw. It's what has broken math benchmarks. It's more akin to RLHF than throwing compute at the problem. Think of it as an LLM fine-tuned to reason like a human. This account also argues that GPT Next, which they say is internally called GPTX, but what we would call GPD5 is, quote, also ready to go.
Starting point is 00:11:22 Lots here relies on safety and what Google does next. It's difficult to say if competition will trump safety. Though red teaming is finished and post-training is done, this model is such an enormous leap in capabilities that's becoming impossible to make the model safe. If you have this particular model unlocked, you could easily disrupt the world on an unprecedented scale. When you mix in voice, SORA agents in the eye-watering capabilities, things heat up. They'll get the safety right and then they'll roll it out, I'm sure. They also say that in addition to it just being really expensive, those safety concerns are also delaying SORA. Ultimately, they say, we will get to a step change next week. It won't quite be GPT5. GPD 5. GpT.5 slash next slash X is more comparable to the jump
Starting point is 00:11:58 made from GPT1 to 4. This is why Sam feels great. Ilya was right. You can scale your way to a digital god with or without strawberries, but strawberries plus scale will cure world problems overnight. Now, there is more than a little skepticism out there. Google's Francois Chalet writes, there have been AGI achieved internally rumors spread by OpenAI every few weeks or months since late 2022, and you guys are still eating it up for the endth time. If you were actually close to AGII, you wouldn't spend your time shit posting on Twitter. Logan Kilpatrick, formerly of OpenAI now at Google, writes, anyone who thinks I rule the world MO is AGI or some advanced model has no idea how OpenAI operates as a company. News outlets are now trying to
Starting point is 00:12:37 dig into who this anonymous account is. James Poulos at Blaze writes, text most mysterious and controversial new account followed me on X. Is it the Antichrist? Is it a bot? Is it a person the internet can't say for sure? And techies are going nuts arguing over it all. Meet I rule the world M.O. An out of nowhere account that almost immediately drew replies from the likes of Sale Maltman, due to its seemingly acute insider knowledge of impending breakthroughs and so-called artificial general intelligence. The neuron pointed out that it was far from conclusive that the anonymous chatbot on Limsys chatbot arena was actually strawberry or even particularly good. As they write, reviews are mixed. Some see it as a
Starting point is 00:13:11 potential strawberry drop with improved logic. Others think it's a minor upgrade to chat GPT4O with little evidence of reasoning. Sometimes it performs better than other models. Sometimes testers prefer others. Some commenters even think chat GPT may have already received a secret upgrade. And indeed you see that if you float around on Twitter slash X as well. Hyperite CEO Matt Schumer writes, something might be going on with GPT40. For the first time, in a long time, it provided better vibes on an output than 3.5 Sonnet. Really surprised, we'll keep using it today to see if it continues. That's from earlier today Monday, August 12th. Maybe even more interesting were these tweets from Perplexity CEO Aravan Shrinivas that seemed to imply that they're running Strawberry slash Q Star. Again from Monday when
Starting point is 00:13:52 someone tweeted, Perplexity Pro is already running the strawberry. Arvon quote tweeted it with the eyes emoji and three strawberries. On August 11th, he posted a screenshot of a perplexity query around the question, how many R's in there are many strawberries in a sentence that's about strawberries? He added the caption, Guess What Model This Is? Again, Strawberry Emoji. He also had the classic psycho-entrepreneur phone battery life of 6%. As the AI safety memes account pointed out, Arvund is also tweeting out various examples purporting to show strawberry in action. One of them from a day ago, the berries are scheming, is the caption. Even for OpenAI, which does a lot of hinting and likes to release things a little bit in advance of saying that they're releasing them,
Starting point is 00:14:31 this is a lot of hinting. At this stage, while I don't have a ton of sense of who's behind this meme account and don't want to give it too much credence, it seems highly unlikely that we're not going to see something this week. At the same time, I also think it's pretty unlikely that most people won't be disappointed based on just how hyped it has gotten. However, as soon as it comes out, whatever it is, you will of course hear about it first. Until then, appreciate you listening or watching as always. And until next time, peace.

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.