The AI Daily Brief: Artificial Intelligence News and Analysis - ChatGPT Launched 1 Year Ago Today: A Retrospective

Starting point is 00:00:00 Today on the AI breakdown, we're doing a one-year retrospective of ChatGPT and OpenAI on ChatGPT's first birthday. The AI Breakdown is a daily podcast and video about the most important news and discussions in AI. Go to Breakdown.network for more information about our YouTube, our Discord, and our newsletter. Welcome back to the AI breakdown. Today we are doing something just a little bit different. In many ways, this is one of the more significant moments in the young history of the generative AI space, because on this day one year ago, OpenAI released fairly innocuously and without much fanfare a little chatbot they called ChatGBT.T. The blog post they used to release it was not particularly

Starting point is 00:00:45 bombastic. They write, we've trained a model called ChatGBT, BT which interacts in a conversational way. The dialogue format makes it possible for ChatGBTT to answer follow-up questions, admit its mistakes, challenge incorrect premises, and reject inappropriate requests. They said we're excited to introduce ChatGBTT to get users feedback and learn about its strengths and weaknesses. The limitations they give as chatchipt sometimes writing plausible sounding but incorrect or nonsensical answers. ChatchapT being sensitive to tweaks in the input phrasing or prompting. The language being excessively verbose in overusing certain phrases. The model not asking clarifying questions but instead guessing at what users intend, and of course the ability for sometimes inappropriate questions

Starting point is 00:01:24 or prompts to slip through. Now like I said, when the OPDI team released this tool, they did not expect it to blow up the way that it did. However, almost immediately people were incredibly excited. Back in June of 2022, Sam Altman had tweeted about how OpenAI products had been increasing in the speed with which they got to one million signups. As he wrote, took GPT3 24 months to get there, co-pilot around six months, and Dali only 2.5 months. Well, as Greg Brockman pointed out on December 5th of last year, chat GPT crossed one million users in just five days. And that would certainly not be the end of it. Almost immediately we started getting articles like this one from the New York Times, the brilliance and weirdness of ChatGBT.B.T. A new chatbot from OpenAI is

Starting point is 00:02:06 inspiring awe, fear, stunts, and attempts to circumvent its guardrails. Wrote Kevin Roos at the time, chattybety is quite simply the best artificial intelligence chatbot ever released to the general public. For most of the past decade, he writes, AI chatbots have been terrible, impressive only if you cherry-pick the bot's best responses and throw out the rest. In recent years, a few AI tools have gotten good at doing narrow and well-defined tasks like writing marketing copy, but they still tend to flail when taken outside their comfort zones. But chat chabit feels different. Smarter, weirder, more flexible. It can write jokes, some of which are actually funny, working computer code and college-level essays. It can also guess at medical

Starting point is 00:02:42 diagnoses, create text-based Harry Potter games, and explain scientific concepts at multiple levels of difficulty. The potential societal implications of chat chit he writes are too big to fit into one column. Maybe this is, as some commentators have posited, the beginning of the end of all white-collar knowledge work and a precursor to mass unemployment. Maybe it's just a nifty tool that will be mostly used by students, Twitter jokesters, and customer service departments until it's usurped by something bigger and better. Personally, I'm still trying to wrap my head around the fact that Chat GPT isn't even OpenAI's best AI model. That would be GPT4, the next incarnation of the company's large language model, which is rumored to be coming out sometime next year. We are not ready.

Starting point is 00:03:18 Well, apparently the world was ready, or at least it wanted to be, because by the end of January, chat GPT had become the fastest growing startup in history. It reached 100 million active users, something like six to eight weeks after being launched. The previous fastest product to reach 100 million users was TikTok and that took nine months. Instagram, it took two and a half years. Now, Open AI was not alone in all of this. They had a big power player partner in Microsoft and at the end of January, they announced that their deal was deepening even farther.

Starting point is 00:03:48 After investing in both 2019 and 2021, Microsoft would be investing another $10 billion, giving Open AI the capital it needed to compete in an extremely resource-hungry. environment. Throughout the spring, OpenAI added capacities to ChatGPT and made it more useful for developers. At the beginning of March, they introduced APIs for both ChatGPT and Whisper, Whisper being their extremely advanced speech-to-text model which makes Siri look juvenile at best. And then, of course, on March 13th, came GPT4. GPT4 was obviously significantly better than GPT 3.5 that we had been using with ChatGPT, but what we didn't realize at the time was that even in spite of an incredible amount of effort, an AI

Starting point is 00:04:28 technological arms race coming from all sides of the technology industry, GPT4 would remain the undisputed state-of-the-art throughout 2023. It remains so today, leading some to recently wonder what it is about GPT4 that's made it such an impossible barrier for other models to breach. At the end of that same month in March, chat GPT started supporting plugins. Plugins were a way for chat GPT to interact with other outside information and applications. Remember at this time, chatGPT wasn't natively connected to the internet. That wouldn't come until a little bit later. And so plugins were ways for all the various applications out there, as well as many new ones, to interact with chat GPT in a bunch of different ways. For example, the XPapers plugin allowed

Starting point is 00:05:11 chatypte to go browse through archives database of research papers, making it extremely useful for content creators who have to summarize dense and abstract artificial intelligence research with some frequency. Now, plugins may never have fully taken off. It was always kind of a rough experience, there was never really a great browse feature, and of course, custom GPTs, which would come much later, would kind of open up the question of whether they were really valuable in the first place. What's more, as Sam Altman said even at the time, it seemed like while a lot of developers thought they wanted their applications inside ChatGPT, what they probably wanted was chat GPT inside their applications. Still around March and April, we started to see the peak of

Starting point is 00:05:48 excitement and hype around ChatGPT. GPT4 was out, plugins had been released, and one of the plugins that OpenAI themselves were working on was completely taking over Twitter slash X, and that was called Code Interpreter. On April 29, John Backis, one of the lucky few who had access to it at that time, wrote, the code interpreter feature on ChatGPT is the most mind-blowing thing I've seen yet. All I did was upload a CSV of SF crime data and ask it to visualize trends, which Code Interpreter, by the way, did with aplomb. Now, when Code Interpreter rolled out a little more fully later that summer, many people said that it functionally turned GPT4 into GPT4.5. The reason for that is that by giving ChatGBTGPT the ability to write code, it created a whole new

Starting point is 00:06:29 pathway to solve problems that it previously had been unable to solve. That sort of step-function change in what it could do felt to many like a difference in kind, not just a difference in scale. However, for our journey through ChatGPT and OpenAI's first big year, we're back in May, when OpenAI CEO Sam Altman was the star witness at the Senate's first AI hearing in the post-Chat CBT world. Now, at that hearing, Altman articulated the position that he's large, largely had in every interview, both before and since, which is that although these technologies were likely to do net good, there were some serious risks, including an up to an existential risk, and that because of that, the U.S. did need to put in society and state-level guardrails.

Starting point is 00:07:07 This, of course, led to accusations of OpenAI and Sam Altman trying to promote regulatory capture, in which the introduction of new regulations and the difficulty of compliance cements the lead of companies who are already in a poll position and blocks out the ability for others to compete going forward. Meanwhile, right around the same time that Sam was testifying chat GPT was getting even more useful by moving to mobile. Part of the change there was that OpenAI's whisper speech to text made it actually a super useful experience when on the go. I used it, for example, as a live tour guide and art historian when I was traveling in London and Paris over the summer. Speaking of traveling, much of

Starting point is 00:07:41 Sam Altman's May and June were spent crisscrossing the globe talking to everyone from open AI developers to heads of states. Indeed, Altman went on a 22 country tour, with the goal to, as they put it, spend significant time directly engaging people who are interacting with and who are affected by our technology. Now, in their summary blog post, they wrote things like, their users and developers are already building valuable applications. There are common hopes and concerns for AI's impact among communities. Policymakers everywhere are deeply engaged on AI and that people want to know more about OpenAI's core values. One really interesting moment for there came from a blog post that was quickly deleted, though was basically the summary of what was

Starting point is 00:08:18 supposed to be an off-the-record conversation with developers in London, in which, among other things, Sam Altman talked about how the AI chip shortage was meaningfully and materially slowing the company down. They had wanted to release a multimodal version of ChatGPT that year, but they weren't sure they were going to be able to because of the chip shortage just as one, for example. Now, like I said, that post came down pretty quickly, but of course, the leaks had already been digested by the broader public. Now, at the same time this was happening, the narrative had started to shift a little bit around not just chat GPT, but around generative AI more broadly. One of the biggest catalyst for that was when Simple Web came out with statistics that

Starting point is 00:08:53 suggested that Chatchabit's usage had actually gone down between May and June, the first month where that had happened. My point at the time, which I think history has validated, was that one, given that we didn't have any precedent for an application that grows to 100 million users in six weeks, we don't exactly know if it's actually normal to have some amount of a cool-off period, that two, school being over would have a material impact, given how acutely there is a product market fit between ChatGPT and LLMs in general, and a wide variety of educational work, and three, more than anything else, after six months of totally fawning coverage, the media was just ready for a new narrative and a different story. Summer was comparatively quiet. One announcement that

Starting point is 00:09:33 hinted that the future was custom instructions, in which users were able to specify a little bit more information about who they were and what format they liked their answers to be in by default to help chat GPT have some amount of persistent knowledge of the user who was interacting with it. We also saw OpenAI join Anthropic, Google, and Microsoft to launch the Frontier Model Forum, which is a self-regulatory body responding to growing questions of regulation and policy and AI safety guardrails that were coming up in the public sphere more and more often. Over the summer, there was also the introduction of a new team at OpenAI called the Superalignment team. It was being led by Jan Lakey and OpenAI co-founder and chief scientist, Ilya,

Starting point is 00:10:11 Sutskever and set out the very ambitious goal of solving alignment with super-intelligent AI, or at least solving the core technical challenges of that alignment in the next four years. In addition to that lofty goal, they also announced that they would be dedicating 20% of the compute that they had secured to date to that effort. Now, as we would learn later, the super alignment team was in some ways reflective of broader divisions starting to happen within the company, but at the time was sort of just taken as another example of open AI's commitment to this sort of goal of a lot of In fact, I think I remember even Eliezer Yudkowski saying that his P-Doom had gone down just a little

Starting point is 00:10:46 bit after reading the announcement. Now, by the end of August, in the beginning of the fall, we were heading right back into feature mode. One of those came on August 28th when OpenAI introduced ChatchipT Enterprise. Chat Chachapet Enterprise was, of course, meant to address questions of security and privacy and data integrity and give big business users a version of Chat Chabit that they didn't have to be scared with leak proprietary information or otherwise mess up their privacy and security in any particular way. Throughout the year, we've seen the enterprise AI battle increase.

Starting point is 00:11:16 Amazon has been extremely active in that space. Of course, just this week, we got Amazon Q, which is a chatbot dedicated entirely to the enterprise use case. But so too has Microsoft, so too has Google. This is clearly an area where many people are focusing, and ChatchipT and OpenAI were no exception. On almost the same date of that Chatchapit Enterprise announcement, the information and others also reported that OpenAI had passed a $1 billion revenue run rate. Now, on the one hand, that was ahead of revenue projections that they had previously shared with investors, but at the same time, it still did not make them profitable. The cost of training, these models, the cost of compute, was still keeping them in the red, even with that significant milestone. At the beginning of September, the company really got people excited with the announcement of their first ever developer conference. OpenAI's Dev Day was slated to take place on November 6th, and pretty immediately speculation turned to what would be coming. Now, in his announcement of the event, Altman had made sure to say that there would be no GPT for, 4.5 or GPT5, so don't even get your head in that way. But people are still speculating on everything from better prices to longer context windows to potentially the beginning of AI agents.

Starting point is 00:12:22 Interestingly, however, even with that big day coming, Open AI didn't slow down, but in fact increased the speed of announcements. On September 25th, they announced what would come to be known as ChatGBTGPT vision. Basically, this was the first walk of ChatGBTBTBT into the realm of multimodal. With ChatGPT vision, users could take a picture and use that as input for chat GPT which opened up a whole new set of use cases. For example, imagine trying to fix a bike, pointing your camera phone at the troublesome area, and asking ChatGPT for help. The voice capabilities meant that people could interact with ChatGPT primarily by talking and listening. A couple weeks later, we got the latest version of the Dolly 3 Image Generation model, and importantly, it was

Starting point is 00:12:58 integrated directly into ChatGyPT. It had a bunch of differentiations over other competitors like Mid Journey, a notable one of which is that it could actually handle and recreate text. But for sure, the most significant thing about it was that it again added multimodal dimensionality to the chatypti product itself. Instead of having to learn a whole new lexicon of prompting, users could interact with Dolly 3 in natural language, and when the results came back and they weren't perfect, get specific in describing what needed to be changed and how. I know personally that shift in interface from having to learn these specific prompting norms of a tool like Mid Journey to just being able to interact in natural language has seen me shift a significant amount of my behavior

Starting point is 00:13:37 from Mid Journey to Dali 3, and I am a diehard mid-jury stand. Still, in many ways, the most significant and exciting announcement of the fall was, of course, the introduction of custom GPs at the OpenAI Dev Day. Now, there was a ton of other stuff that happened at that dev day as well, including the anticipated decreased prices and longer context windows and improved API performance and all that sort of stuff. But the big show was GPTs. GPTs represented OpenAI's very first steps into an agent future

Starting point is 00:14:05 and were basically ways for people to create versions of chat CBT that were refined and focused to a specific purpose. The core of it was effectively a setup prompt that was embodied and kept persistent in the GPT, but then users could also combine it with external actions to make it a more robust tool. When Altman and his team introduced GPTs at Dev Day, they made very clear that while they weren't AI agents, they were a very first fledgling step into that arena. Now, even if that had been the only thing that happened in November, it still would have been a big month for chat GPTBT, and yet it pales in comparison to what happened next. On Friday, November 17th, out of the blue at the very end of the day, although not before the end of the trading day, much to Microsoft

Starting point is 00:14:45 chagrin, the OpenAI board announced that they were firing Sam Altman because they had lost trust in him and that Greg Brockman was also being demoted from the board of directors. Now, I will not go into everything that happened in the subsequent five days. I've obviously done nearly endless content about that. But of course, ultimately Sam Altman would return. The board would get a new starting composition, and the company begins now its second year in a very different place than it might have thought it was going to just a couple weeks earlier. Altman now has the new task of rebuilding trust among investors, customers, the board, of figuring out how governance of Open AI is going to work going forward.

Starting point is 00:15:22 In the meanwhile, they have numerous contenders who are nipping at their heels, even if none of them have actually outperformed GPT4, and the policy and regulatory conversation is likely to do nothing but increase in significance in the year to come. Taking a step back, chatGBT is singularly, whether it is the dominant chatbot in the future, whether OpenAI makes the most important AI models of the future, whether it's actually the company to achieve AGI or someone else is, it will forever be the case that chat GPT was an inflection point moment for the world, a zero-to-one moment that divides the world crisply and cleanly into a before and an after. It's very rare for it to be so precise that something like that happens. But I think in this case it's fairly undeniable. And somehow, some way, it was just one year ago that it happened. I, for one, cannot wait to see what the next year, the second year of Chad GPT's life brings. You can bet I will be here to help you navigate through it all as it happens.

Starting point is 00:16:20 Thanks as always for listening or watching. And until next time, peace.

The AI Daily Brief: Artificial Intelligence News and Analysis - ChatGPT Launched 1 Year Ago Today: A Retrospective

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.