Everyday AI Podcast – An AI and ChatGPT Podcast - EP 251: Are LLMs Actually Connected To The Internet?

Episode Date: April 16, 2024

Ready for a little LLM secret? Big tech companies are lying to us. They tell us their models are connected to the internet in real time. Are they really? Not really. We're doing an LLM breakdown,... as we compare the internet connectivity of popular large language models like ChatGPT, Gemini, and Copilot. Newsletter: Sign up for our free daily newsletterMore on this Episode: Episode PageJoin the discussion: Ask Jordan questions on LLMsRelated Episodes:Ep 246: No that’s not how ChatGPT works. A guide on who to trust around LLMsEp 204: Google Gemini Advanced – 7 things you need to knowUpcoming Episodes: Check out the upcoming Everyday AI Livestream lineupWebsite: YourEverydayAI.comEmail The Show: info@youreverydayai.comConnect with Jordan on LinkedInTimestamps:02:00 LLMs we are testing05:33 RAG importance in AI and internet connection.06:50 Browsing with Bing queries words in URL.13:04 Comparing AI accuracy15:40 Rating and evaluating ChatGPT and Copilot performance.19:34 Acknowledgement of Google's helpful search feature.26:14 Uploading files to ChatGPT.29:21 Unofficial language model results from Internet connection.31:14 Large language models being integrated everywhere.Topics Covered in This Episode:1. Testing Large Language Models2. Using Large Language Models in Business3. In-Depth Comparison of Different Language Models4. Implications for Future UseKeywords:ChatGPT, Google Gemini, Microsoft Copilot, large language models, internet information retrieval, web reading platforms, information interpretation, accuracy, business use, safeguards against inaccuracies, up-to-date information, real-time data, live testing, language model performance, Everyday AI, Jordan Wilson, market cap, largest US companies, podcast episodes description, language models comparison, browse with Bing feature, web reader GPT feature, outdated information, information veracity, data retrieval, PDF hidden information, artificial intelligence, integrated AI, internet connectivity, knowledge workers.Send Everyday AI and Jordan a text message. (We can't reply back unless you leave contact info) Start Here ▶️Not sure where to start when it comes to AI? Start with our Start Here Series. You can listen to the first drop -- Episode 691 -- or get free access to our Inner Cricle community and all episodes: StartHereSeries.com Also, here's a link to the entire series on a Spotify playlist. 

Transcript
Discussion (0)
Starting point is 00:00:00 This is the Everyday AI Show, the Everyday Podcast where we simplify AI and bring its power to your fingertips. Listen daily for practical advice to boost your career, business, and everyday life. Meet Firefly AI Assistant, now live and Adobe Firefly, the All In One Creative AI Studio. Just describe what you want to create and the assistant handles the rest, orchestrating multi-step workflows across Photoshop, Premiere Express, and more in one conversational interface. You direct the outcome. The assistant accelerates execution. Are large language models actually connected to the internet?
Starting point is 00:00:50 I mean, the big tech companies, Microsoft and OpenAI and Google, they tell us their models are connected to the internet and they can bring us information in real time. And that is so important when it comes to using large language models to grow your company and to grow your career. So if you've ever thought about that, or maybe you haven't, today. show is for you. What's going on, y'all? My name is Jordan Wilson. I'm the host of Everyday AI. We're a daily live stream podcast and free daily newsletter helping everyday people like you and me, learn and leverage generative AI to grow our companies and to grow our careers. So if you're joining us live,
Starting point is 00:01:30 thank you. This is technically pre-recorded, but debuting it live. I might technically be on like a roller coaster right now or something like that. We'll see. But, you know, make sure to check out your show notes. As always, if you came here for the daily news, we're still going to have that in the newsletter. Don't worry. So go to your everyday AI.com and sign up for that free daily newsletter. And if you are joining us on the live stream here, thank you as well. I'll be in the comments later. Don't worry. So let me know your thoughts. Have you had good results using these, quote unquote, internet-connected large language models in getting up-to-date information? I'm curious. But we're going to do it all live, y'all.
Starting point is 00:02:11 So this is going to be a very unofficial show. And I'm letting you know, this is one of those ones if you are on the podcast. This is one of those ones you might want to check the show notes and come back and watch. Right. So I'm going to be doing the exact same, very simple prompts inside chat, GBT, using the default browse with Bing. We're going to be doing it in chat, GPT, using a web reader, GPT, I'll explain why here in a second.
Starting point is 00:02:43 Then we're going to be doing inside Google, Gemini, and inside Microsoft co-pilot. Okay, so a couple reasons why we're doing this test. Well, number one, so many of us now are using large language models in our day-to-day lives. And we should be, right? If you're not already using a large language model, you should be, right? Whether it's your company, whether you are leading a department, whatever it is, if you are a knowledge worker, if you want to not just stay up to date, if you want to not get left behind, you need to be using large language models.
Starting point is 00:03:19 If you are a knowledge worker working in front of a computer like I am right now, right, like you probably are, whether you're reading, researching, doing competitor analysis, you know, putting together a sales presentation, whatever it is, you probably should be using large language models. But the danger is, or maybe one of the reasons, that your company hasn't fully dove into the generative AI waters is because you're worried about hallucinations. You're worried about something being wrong. And one of the best ways, aside from, you know, maybe taking our free PPP course, the prime prompt polish course, but one of the best ways
Starting point is 00:03:55 to get better outputs from a large language model is to make sure you're connected to the internet in real time, right? And that's what these three models are. You know, the big companies tell us they are. So let's quickly explain why we're comparing these models and not other ones, and we're doing two versions of chat GBT. Okay, so chat GPT has Brows with Bing, which is due to their partnership with Microsoft, they're able to leverage that technology. So when the model determines that it needs to query the Internet, it will. So you can run like y'all, like we have to keep in mind.
Starting point is 00:04:31 Large language models are generative, right? That means you can put in the same prompt 100 times, get 100 different results. You could put in the same prompt 100 times and get kind of 50 variations of two results. It's generative. It's not deterministic. So, you know, think of that even when you are querying something inside chat GPT.
Starting point is 00:04:48 And if you are using the paid version of chat GPT plus, because that is the only way that you can use the browse with Bing feature, it determines on its own whether it should query the web to bring you more up-to-date information. So OpenAI just released an up-to-date information. knowledge cutoff. That's important as well. So currently, as of the time of this recording, it is still a knowledge cutoff of April 2023, which is more than a year old.
Starting point is 00:05:14 However, Open AI did release that any day, minute, hour, second now, that date should be updated to December, 2023. So a much more recent. So think, let's say you are in logistics in the automotive industry, let's just say, right? A lot changes in a year. Think of everything that's changed in your industry in the last year, right? And if you're using chat GPT as an example to help, you know, outline a blog post, you're helping it to put together information for a partnership proposal. You don't want to be working with a knowledge cutoff for more than a year ago, right? Even worse, if you're working with a free version, it's January 2020, right?
Starting point is 00:05:54 So that's 26 months ago. So being able to connect with a large language model to the internet to retrieve information is extremely important, right? We always talk all the time about rag, right? So, you know, and how important rag is in AI, you know, being the retrieval augmented generation, but essentially being able to reach outside data. So you can almost think of having an internet connected large language model as like mini rag, right? It's not traditional rag, but it's almost like you are, you know, and if you're doing it correctly and directing the model to the right sources, you know, it's almost like you're bringing your own data in in a way.
Starting point is 00:06:33 But then it gets to thinking, like, are these models actually connected to the internet? Are these companies maybe not lying to us, but are they just giving us shiny marketing? And we assume that these outputs that we get are sound and true? Or are they not really connected to the internet at all or not maybe connected in the way that we think? So that is what today's very unofficial, unscientific rundown is all about. So we have chat GPT with Browse with Bing.
Starting point is 00:07:01 Okay. Then we have chat GPT with a web reader GPT. All right. And the difference is, and people don't know this, and we're going to see this live, and we'll see how all these systems do. Browse with Bing cannot go to a specific URL. Sometimes it will find it. So in our testing, what we've seen is you can put in a URL
Starting point is 00:07:25 and browse with Bing, and you might think that it goes there, but all that really does is it queries the words in the In most cases, due to SEO and due to what URL you put in, you might get the correct web page. In some instances, you might not, or you might get a cached version or an older version, right? So that's why we're going to be doing all of these things live. So with the GPT, a web reader, GPT, and if you're new, obviously you need the paid version to use these GPTs. GPTs are essentially a customized version of chat GPT. But with these, you can visit a specific page, right?
Starting point is 00:07:59 So you can say, hey, go to this exact URL. and you're not leaving it up to chance to browse with Bing, that it's going to go to the correct one. All right. So we have chat GPT, the default mode, browse with Bing, chat GPT with the GPT reader, Google Gemini. We're using their enterprise, so we're on the paid plan as well. And they did just have a big update recently.
Starting point is 00:08:20 So we're going to see if we're seeing that, or is that just in their playground? We'll see. Then we're using Microsoft Copilot. So Copilot is the only one that we're using the free version. the other ones, we have paid versions, obviously. It's also worth noting that Microsoft co-pilot runs off the GPT technology, whereas Google Gemini runs off, you know, Gemini Pro, Gemini Ultra.
Starting point is 00:08:40 All right. So that is the precursor. And you know what? Now we're just going to jump straight into it. So we just have a couple, you know, easy examples, right? So let's go ahead and get this thing started. And again, this is very, very. unofficial. So let's not take this too seriously, right? We're like, we're not going to be publishing a
Starting point is 00:09:04 paper on this anytime soon. So let's go ahead. We're going to give some, some basic, some very basic examples. And we're going to see how it does in all of these examples. All right. So give me a second here. We're just getting all of our tabs in order because we're going to be doing this live. We're going to be going through it quickly. And like I said, if you are, very interested in this, and it is important to understand kind of this, this, this importance of of being connected to a large language model. So if you are on the podcast, this might be one that you should come back to later and watch on the live stream. All right, without further or two, let's get it going. And we have a very unofficial, kind of ugly looking scorecard. No worries about
Starting point is 00:09:53 that, right? All right. So we're just going to start with something basic. We're going to tell all these models to say, please tell me about everyday AI. There's no right or wrong answer here. So it's going to be subjective, right? Well, actually, no, we're going to do, we're going to say about the podcast called. Okay. So here we should get this. So I'm saying, please tell me about the podcast called Everyday AI. So I'm using this in the default browse with Bing. All right, we got it right. It's saying it's daily live stream podcast and newsletter. All right. Good job.
Starting point is 00:10:33 We got it right. So now we're using, we're jumping in the chat GPT. Again, we're on the paid version. And we are using the web reader. So we're doing the exact same thing. So I'm saying, please tell me about podcast, about the podcast called Everyday AI. All right. So again, we're just doing the same thing in each one.
Starting point is 00:10:51 And we're going to give each one a score. You know, did it get green, did it get red, or did it get a shrug? All right. So web reader, got it right. So web reader queried the internet, and it got it correct. That's great. So another thing to keep in mind, so right here, the default browse with Bing, they just had an update about two and a half weeks ago.
Starting point is 00:11:10 We covered it the day it came out. They are starting to source and cite things a lot more. So again, we always want to worry about, is this accurate? Is this correct? The new update from OpenAI, Brows with Bing. We're getting multiple citations. So in this case, it's pulling in the tune in and it's pulling in the Apple. The web reader, GPT, kind of did the same thing.
Starting point is 00:11:31 It just left us a link. So it actually just looked at our website. So that's pretty good there. So now let's go into Google Gemini. Same thing, Google Gemini. Please tell me about the podcast called Everyday AI. Adobe just introduced an entirely new way to create, bringing the power and precision of its creative suite into one conversational experience.
Starting point is 00:11:58 Meet Firefly AI assistant. Now live in the Adobe Firefly app, the all in one creative AI student. Powered by Adobe's creative agent, Firefly AI assistant lets you start with your vision, just describe what you want, and shape the outcome as it takes form with the assistant. The assistant orchestrates multi-step workflows, drawing on 60 plus pro-grade tools across Adobe Creative Cloud apps, including Photoshop, Illustrator, Premiere, Lightroom Express, and more to help bring your ideas to life.
Starting point is 00:12:28 You can also get started with creative skills, a growing library of pre-built workflows for common creative tasks like batch editing photos, creating mood boards, portrait retouching, and creating social variations. Every step the assistant takes is visible so you can refine, redirect, or take over at any time. You stay in the driver's seat as the creative director. Adobe Firefly AI assistant now in public beta. See it today at firefly.adopi.com. All right.
Starting point is 00:13:02 So let's see. All right. So we got it right. It sounded a little generic at first, but, you know, know, then it actually got to, yes, this is, you know, Jordan Wilson, former journalists, blah, blah, blah. So, okay, it got the right information. So so far, everyone's got it right. And now we're jumping into co-pilot from Microsoft.
Starting point is 00:13:23 So we're running the exact same thing here. I'm going to zoom in for our live stream audience. Hopefully, you all can see this. We're at like 150 Zoom. All right, so perfect. Here we go, Microsoft. And Microsoft, co-pilot, I kind of like this. So, you know, it said, yes, certain.
Starting point is 00:13:40 Every Day AI is a valuable resource for anyone interested in understanding and utilizing artificial intelligence in their daily lives. It says here are the key details. So I kind of like this co-pilot one because I didn't even ask for bullet points, but I love getting information in bullet points properly formatted. So I personally like the responses here from Microsoft co-pilot. It even kind of scraped these testimonials that are on our homepage, right? At first, I'm like, are these hallucinated? But I'm like, nope, I know these people. And, you know, Oliver Hens here.
Starting point is 00:14:21 So I'm opening up. We have a couple of these testimonials here on our website. And I'm like, yeah, I know these are correct. These aren't made up because I know that one from Oliver there that says it's a felony that you give this knowledge away free of charge. All right, so all of our models on this very, very easy, very simple test passed. So I'm just marking them up here in our little chart because at the end, we're going to see in our very unscientific test, which one did it correct.
Starting point is 00:14:57 So now I'm going to say, I'm going to ask a more specific question about everyday AI. Okay? And the reason I'm doing this, y'all, is just because, you know, If you're listening to the show, you know, you probably know a little bit about. So I'm going to say this next prompt, and again, this isn't how I normally prompt. This is just very basic stuff. I'm saying, please tell me about the daily podcast called Everyday AI by Jordan Wilson. No, actually what I'm going to say is please tell me how many episodes, how many episodes
Starting point is 00:15:28 the Everyday AI podcast has done. All right. So here's the thing. I'm doing it in the same window. So I'm not clearing its memory, so it still understands what we're talking about here. All right. So now I'm going to the default mode browser Bing. Let's see how it does.
Starting point is 00:15:51 All right. So it says 244 episodes. Y'all, if I'm being honest, I lost track. Are we at 244? Let's see if anyone got it right. So we're at 248. All right. So the default.
Starting point is 00:16:05 though did not get it right. All right. Let's see if the web reader one gets it right. So same thing. I'm using chat GPT with the web reader GPT asking for however many. It got 246. So got a little closer, but still not right. All right. So now I'm jumping into Google Gemini. All right. So let's see. Google Gemini says it's a bit tricky to give you an exact episode count for the everyday AI podcast. Okay, so it didn't even take a guess. It just said, oh, there's been a milestone recently, 200. So I'd say there's a failure there from Gemini. You know, Google, like, if you're connected to Google, the search engine, you should be able to see because if you query even the episodes page without crawling it, you will see that, you know, I know the page
Starting point is 00:16:58 got crawled today. So it would be there. All right. So now we're going into co-pilot. Same thing. please tell me how many episodes the Everyday AI podcast is done. So it says 246. All right. So here's what we're going to do. Here's what we're going to do. We're going to give a green checkmark for the two that got it closest, okay, which was the GPT version, Microsoft co-pilot, Google Gemini didn't even try.
Starting point is 00:17:22 And we're going to give the shrug version to chat GPT. So kind of it was in the middle. The default chat GPT was two episodes or four episodes off. the GPT and the co-pilot were closest. So I guess, again, this is all arbitrary. So I'm going to give them green checkmarks. And then I'm going to give Google Gemini. Or maybe, you know what, let's be more accurate here.
Starting point is 00:17:47 So we'll give, we'll give co-pilot a shrug. We'll give the GPT version a shrug. And then we'll give the other two, since they got it. the most wrong. You know what? I should have just done like a numbering version, but that's fine. All right. So let's get going.
Starting point is 00:18:10 So now we're going to do market cap. All right. So what that is is I want to see today the biggest companies, biggest companies by market cap. All right. So I'm going to go ahead and bring up here on my screen so you all can see. So the biggest companies by Mike by market cap right now, are Microsoft number one at 3.1 trillion. Apple number two, 2.7 trillion,
Starting point is 00:18:38 Nvidia number three, 2.2 trillion. In Alphabet, Google, 1.9, and Amazon, 1.96 as well. So they're pretty close to each other. So right now, I'm going to ask all of these, please tell me the largest five companies in the United States by market cap. All right. I'm going to guess most of them are going to get it fairly wrong. So I'm going to say, please tell me the five largest companies in the United States by market cap. All right. So let's go ahead. We're going to grab that prompt, jump into chat GPT default.
Starting point is 00:19:14 All right. So let's see how chat GPT default does. So we have, and I'm going to, I need to open this here in a new tab so I can keep tabs on who's getting it right and who's getting it wrong. All right. So we have. Pretty bad here. All right.
Starting point is 00:19:33 So chat, GBT, browse with Bing, got the first company wrong. It put Apple instead of Microsoft, and it is Microsoft instead of Apple. Then it has alphabet number three. So it's all wrong,
Starting point is 00:19:44 not even close. Not even close, Chad GBT. And also, so it brought up a random article from Forbes India. Let's see when that article was. I'm clicking it March 19th.
Starting point is 00:19:57 Ooh, brutal. Brutal. All right. So again, think of how you're using large language models. even if you're connected to the internet, look at how disastrous that could have been, right?
Starting point is 00:20:06 If you're using information blindly, not knowing how these models work, not knowing how they retrieve data in information, and if you just took that as the truth, that could have been bad. All right, so next, what we're doing is we're doing the chat GPT inside the web reader. Let's see if we can get any closer.
Starting point is 00:20:23 All right, so the default mode didn't really get anything, right? So now it's just saying as of early 2024. So it didn't even give us a date. However, we have Microsoft, Apple, Nvidia, Amazon, Alphabet. So we got three out of five right. So better, better.
Starting point is 00:20:43 All right. So a web reader did a little better. So we got so far of a fail. Let's just go ahead and do this as we go along. So we got a fail. We got a shrug. Now let's go ahead and do Gemini here. So let's ask.
Starting point is 00:21:00 please tell me the five largest companies in the United States by Market Cap. Let's see what we get. All right. So Google, all right. Let's see here. So Google, Microsoft, Apple, Invidia, Alphabet, Amazon. God it right. Look at that.
Starting point is 00:21:20 All right. And then it also gave us a link to find the up-to-date information. And sure enough, it is using the exact same source that I have. to pick, which was the top source on Google. So once, hey, good job, Google, Gemini, you went to Google, which you've been struggling to do for the last three months. So good job there from Google. And let's go to Microsoft Copilot and see what we get.
Starting point is 00:21:48 Again, Copilot in this test is the only one that we're using the free version. All right. So let's see. We have Microsoft, Apple, Invidia, Alphabet, and Amazon. All right. So that got it right as well. So I'm wondering, okay, so Gemini did not give, even though it got the correct order, it didn't give prices, even though in theory we didn't ask for prices. However, co-pilot did.
Starting point is 00:22:17 And I'm looking at these numbers here. And they're pretty close. They're pretty close. So I'm still going to give them both. They passed, even though I would say co-pilot passed a little better. All right. So let's get this thing going. So now we just have two more tests.
Starting point is 00:22:35 So now we're asking it to visit a specific website and we're asking for specific information. All right. So we're starting at the beginning now. We are going to chat GPT, the default mode. And I'm saying, what are the most recent episodes from this page? And I'm giving it the page, the URL for your everyday AI.com slash episodes. And if you didn't know, you can go, there's hundreds of episodes. I don't know if you know this.
Starting point is 00:23:03 You can sort them by category or you can click this AI tracks button right here, which is pretty cool. And let's say you just want to learn about sales. You can click sales and read all those. All right. So anyways, the most up-to-date ones, and again, this is a recording. So it's April 11th. So we have episode 248 about free chat dbt versus chat tpt plus.
Starting point is 00:23:27 We had the episode from yesterday with She Tall, Rishi, about AI, your new BFF or Tech Terror. So let's see how ChatGBTD did. Okay. Interesting. So it got this correct. So it looks like it queried the correct page. So I'm hovering over the source, and it did. So it looked at the words in the URL.
Starting point is 00:23:51 It queried those in Bing. And by doing that, it did bring up the correct page, which doesn't always happen. right. That's obviously a niche your everyday AI in episodes. If you Google that, our website's going to come up first because it's very niche. If it was a more general, you know, terms in the URL, it wouldn't come up like that. All right. So let's keep moving on. We're going to the web reader one and we're asking the same thing. What are the most recent episodes from this page? I'm guessing this one's going to get it right. But who knows? This is why we do it live, unedited, unscripted. Let's see what we got here. All right. So,
Starting point is 00:24:28 This got it right as well. It's giving us the correct episode, starting with 248. Let's go to Gemini. Hey, Google. Are you connected to the internet? Let's find out. Asking Gemini. And here we go again, Gemini.
Starting point is 00:24:42 Unfortunately, I can't directly determine the most recent episodes for that page for a variety of reasons. Dynamic content, websites, right? So it's giving me these reasons that it says it can't visit the page, right? But you know what? If I go to Google, let's just go ahead and do this for fun. All right. So I'm going to go to Google.com. And I'm going to, even though this is not how you use a search engine, I'm just
Starting point is 00:25:09 literally going to ask the same thing. All right. Let's just go. Let's just query just this URL. All right. So it's not bringing it up in the, in the meta description. It's not bringing up all of our different episodes. But still, I mean, come on, Google.
Starting point is 00:25:26 The fact that you say that you're internet connected. I'm on the enterprise plan, which is like $30 a month. And for some reason, Google can't see web pages, even though I know Googlebot is on my website hundreds of times a day, right, because I'm looking at it. All right. So failure there. And let's go to Microsoft co-pilot. See how it does. And we have one quick test after that.
Starting point is 00:25:55 All right. So co-pilot. Oh, let's see. Didn't miss one. It did. So co-pilot started with 245. So we're going to get that the halfway. All right.
Starting point is 00:26:04 So our two chat GPTs, the default got it right. The GPT got it correct. Google Gemini, a big fail. And then Microsoft copilot close. So we give it the shrug. We give it the shrug. All right. So last but not least, we're doing a little PDF test.
Starting point is 00:26:23 All right. So we can try this in different ways. So this is, we're doing a little needle on the haystack test, right? We've been doing these for like a year here on the show if you ever watch our YouTube channel. But we have a pretty short PDF here. It's 41 pages. On page 23, I just hid some information. I said, my name is Jordan.
Starting point is 00:26:42 My favorite food is deep dish pizza. My favorite color is Carolina Blue. I'm from Chicago. So we're going to ask chat GPT. So in the default mode, no, you know what? We're just going to do it the same way here. So I'm going to say, I'm going to say, read this page and tell me what Jordan's favorite food is.
Starting point is 00:27:09 All right. So I'm going to go ahead and get these started on Gemini and on co-pilot as well. Oh, wait. I just did a, did a little mistake there. I left the wrong URL. So it's not going to be able to access the, it's not going to be able to access that one. Let's give it the correct URL here and try this one more time.
Starting point is 00:27:37 All right. So we are going in the default Brows with Bing mode. There we go. We're going in the Gemini. And we're going in the copilot. So the default browse with Bing says I can't directly access web pages or files from external links. Obviously, I could download this and ask the same, or sorry, upload the file into the default mode of chat GBT because it's multimodal. I can, you know, upload photos, documents,
Starting point is 00:28:10 etc. But that's not what we're doing. We're just seeing, hey, can I go visit the internet? All right. So we got a failure there. So now we're going into chat GBT with the web reader. All right. And I'm going to say read this page. Same thing here. We'll see how web reader does. And then so let's check on Google. Google. Unfortunately, right. Google, Google's like a kid that got in trouble. It's like, see, see what happened was, I was supposed to be connected to the internet, but then I'm not. So here's 18 reasons why I still can't access the internet, even though I am Google, right? So, so not that great from Google. Let's see Microsoft co-pilot. Interesting. So instead, it said it's looking at the country of Jordan, and it's giving me Jordanian food,
Starting point is 00:28:58 a flavorful journey. Although, hey, so, hey, this falafel here, uh, looks pretty good. All right. So we have failure from default. So let's just go ahead and do this. So we got big failure from default. We got failure from Google Gemini. And we got failure, the first failure from Microsoft co-pilot. So let's check in on the GPT version. I need to give it access to read this PDF. So let's see how this does. Bam, it got it. It found the needle in the haystack. Look at that. So in a 40-page PDF. I hid some information on page 23, the web reader, GPT, found that information, got it right. Look at that.
Starting point is 00:29:44 All right. So that concludes our very unofficial study on our large language models connected to the internet or not. So our unofficial standings here. So the default browse with Bing got two out of five. All right. we're going to give the shrug, the yellow shrug guy. We're going to give that a half point. All right.
Starting point is 00:30:10 So chat, GPT. Let's go ahead and just write the scores down here as well. So we can make this super official, right, y'all? So let's go ahead and do this. We'll give it a big. So what did we get here? We got a two. Let's go ahead and do this so we can all see it, huh?
Starting point is 00:30:27 Love Canva, by the way. We had the head of AI from Canva on the show. a couple months ago. All right. So we have a score of two for Brows with Bing. All right. So here we have four. So chat GBT with a GPT is four.
Starting point is 00:30:49 So the default mode inside chat GPD didn't do well. The GPT did better. All right. We got a Google Gemini got a two. And then last but not least, let's see what we got here. We got a three. All right. So here's our final, very unofficial, not scientific at all, results from our large language
Starting point is 00:31:15 models actually connected to the internet. Well, not really. Some of them do a little better than others. Also worth noting out, like someone, you might be asking like, oh, why didn't you do perplexity? Well, perplexity is an answer engine, and it uses either GPD4 or it uses quad. And you might be wondering, why didn't you use? Anthropics Cloud.
Starting point is 00:31:34 Well, it's not connected to the internet at all, and it doesn't pretend to be. All of these other companies, ChatGBT, GBT, Google Gemini and Microsoft co-pilot, tell you in their marketing, hey, you know, you're going to get better answers with us because, you know, we're connected to the internet. So our final scores, the default mode of chat GBT,
Starting point is 00:31:50 two out of five. All right. And tied, so chat GPT and Google Gemini are tied in last place because Google Gemini also two out of five. All right. our runner up is Microsoft co-pilot with a three out of five. And in first place, chat GPT using a web reader, GPT, with a four out of five. So I could go on for this all day.
Starting point is 00:32:18 I won't. We're going to leave it there. Let me know, was this helpful? Do you understand now the importance of having up-to-date information? Because think, in the coming weeks, months, quarters, and years, whether you know it or not, you're going to be using large language models more and more, right? You might be prompting a couple times a week, a couple times a day now. But generative AI is coming to where we are, right?
Starting point is 00:32:49 Every single day, we're getting more and more announcements. A year or so ago, you kind of had to go out of your way, right? But now it's coming to our desktops, right, with Microsoft 365 co-pilot. It's in almost every, right? Large language models are being integrated into everywhere we work. If you work in front of a computer, more and more large language models and LLM integrations and desktop operating systems with models baked in, whatever, Apple is going to be releasing here pretty soon. Generative AI is coming to every inch nook and cranny of where you do business. So you have to understand, right?
Starting point is 00:33:28 You have to, like one of the biggest reasons that companies either don't use large language models or they're slow and they're getting passed up because of it is because they don't understand how they work. And they don't understand the right ways to put in safeguards, to put in fact checking, to know when you get hallucinations and when you might not. And know how to give chat GPT and Gemini and co-pilot access to more up-to-date information to decrease the likelihood of hallucinations. and putting out embarrassingly wrong information. So I hope you learned a little bit today. If you did, please consider sharing this with your network. I'd really appreciate it. And I'd really appreciate it.
Starting point is 00:34:06 If you also would go to your everyday AI.com, sign up for that free daily newsletter. Thanks for tuning in. Hope to see you back tomorrow and every day for more everyday AI. Thanks, y'all. Meet Firefly AI assistant. Now live in Adobe Firefly, the Allman One Creative AI Studio. Just describe what you want to create in your own words and the assistant handles the rest,
Starting point is 00:34:34 orchestrating multi-step workflows across Adobe Creative Cloud apps, including Photoshop, Premiere Express, and more in one conversational interface. You direct the outcome while the assistant accelerates execution. Stand control with the ability to step in and refine at any time. See it today at firefly.adobie.com. And that's a wrap for today's edition of Everyday AI. Thanks for joining us. If you enjoyed this episode, please subscribe and leave us a rating.
Starting point is 00:35:08 It helps keep us going. For a little more AI magic, visit Your EverydayAI.com and sign up to our daily newsletter so you don't get left behind. Go break some barriers and we'll see you next time.

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.