The AI Daily Brief: Artificial Intelligence News and Analysis - AI Unlocks Ancient Greek Scrolls Destroyed by Mt Vesuvius

Episode Date: February 6, 2024

The Vesuvius Challenge is a $1m prize to use AI to decipher text seemingly lost on ancient Greek papyri in the eruption of Mt. Vesuvius that buried Pompei. NLW covers the amazing results. Today's Spo...nsors: Notion - Notion AI. Knowledge, answers, ideas. One click away. - https://notion.com/aibreakdown ABOUT THE AI BREAKDOWN The AI Breakdown helps you understand the most important news and discussions in AI.  Subscribe to The AI Breakdown newsletter: https://theaibreakdown.beehiiv.com/subscribe Subscribe to The AI Breakdown on YouTube: https://www.youtube.com/@TheAIBreakdown Join the community: bit.ly/aibreakdown Learn more: http://breakdown.network/

Transcript
Discussion (0)
Starting point is 00:00:00 Today on the AI Breakdown, we are looking at the Vesuvius Challenge, a $1 million contest to use AI to read ancient scrolls thought lost time and volcanoes. Yesterday, a grand prize was announced. The AI Breakdown is a daily podcast and video about the most important news and discussions in AI. Go to Breakdown.network for more information about our Discord, her newsletter, and our YouTube channel. Hello, friends, a quick note here on today's episode. When I dug into it, I realized that this was going to be longer than our nirmed. main, and frankly, the brief news is pretty quiet today and can be lumped in with tomorrow's, unless you are super, super intensely excited to hear about meta adding labels to their image
Starting point is 00:00:42 generations, in which case I'm sorry that that's going to be a day late, but I think you will like this very unique and different type of story about the Vesuvius Challenge. Back to our normal format tomorrow. Welcome back to the AI breakdown. Most days on this show, we talk about artificial intelligence in terms of exciting new products, or perhaps big policy shifts, changes to the way we'll work, implications of technology and society, all of these highfalutin things, which are incredibly important, both on a personal and a societal level. But then every once in a while, we get a moment to just revel in awe of the human experiences that were totally impossible before that artificial intelligence can unlock. Today is one of those stories because we are talking about
Starting point is 00:01:29 the Vesuvius Challenge. Now, I have talked about the Vesuvius challenge before when some intermediate parts of the contest were won, but the TLDR on this is that for a couple hundred years, people have been interested in how they could read a set of papyrus scrolls that were found in a library in Herculaneum in a villa thought to be owned by Julius Caesar's father-in-law, and which potentially contain writings that have otherwise been completely lost to history. Now, Herculaneum was destroyed in the same eruption that destroyed Pompeii, but interestingly, when it comes to these scrolls, they were, on the one hand, preserved and protected, but on the other hand, melted into, honestly, what looks like tiny dried logs. In fact, when this villa was first discovered in these 1700s, some of the workers actually thought they were logs and burned them, before realizing what they were. The Vesuvius challenge was, in short, a $1 million contest to try to develop new AI techniques to actually read text. from these scrolls without having to physically unroll them. Today we're going to tell the story of how this came about and where the project stands after
Starting point is 00:02:32 yesterday's announcement that the grand prize of $700,000 has been awarded and that the team that won it did even better than the qualifications for that award. Now, I highly recommend reading the Bloomberg Business Week feature story on this by Ashley Vance and Ellen Huit. Ashley said it was one of the coolest stories that he had ever had a chance to work on and I completely agree. First of all, let's talk about what might be contained in these scrolls. As I mentioned, this villa was believed to be owned by a man that history knows as Pizzo who was Julius Caesar's father-in-law. As Bloomberg points out, a wealthy, educated man like Piso would have had the classics of the day,
Starting point is 00:03:06 along with more modern works of history, law, and philosophy. Quote, Piso is known to have corresponded often with the Roman statesman Cicero and the Apostle Paul had passed through the region a couple of decades before Vesuvius erupted. There could be writings tied to his visit that comment on Jesus and Christianity. Now, for those of you who don't spend a ton of time on this, modernity has lost the vast majority of writing that was created in ancient Greece. We have a tiny fraction of the dramas that were written, the philosophy. And much of what we do have was rescued from old moldering handwritten manuscripts in libraries around Europe in the Middle Ages at the dawn
Starting point is 00:03:41 of the Renaissance. So the idea that there could be this trove of potentially tens of thousands of scrolls, at least 800 of which we know about, is absolutely tantalizing for anyone who's interested in classics, or really Western civilization at all. Now, the villa at Herculaneum was rediscovered in the 18th century, and over the last couple hundred years, while the scrolls have generated a ton of interest, nobody has really had a lot of success in actually unlocking their secrets. Over the last 20 years, however, a computer science professor at the University of Kentucky named Brent Seals has been experimenting with and inventing new techniques using advanced medical imaging technology to unlock the mysteries of these types of scrolls. His first attempt to
Starting point is 00:04:21 handle a Herculaneum scroll came in 2009, but the software just wasn't even close to ready. In 2016, he and his students were able to decipher a charred ancient Hebrew text called the I'm Getty, but the technique they used wasn't applicable to the Herculaneum scrolls because they were written with a different type of ink. Enter Nat Friedman. Nat was the CEO of GitHub from 2018 to 2021 and is one of the more prolific AI investors currently. He had overseen GitHub's first experiments with AI-assisted coding, and so it was really familiar with the technology. During the pandemic, Friedman got really into Roman history and came across these scrolls. In 2022, he brought Seals out to California for a fundraising event, but none of his friends or peers
Starting point is 00:05:02 were interested. At that point, however, he suggested the idea of a contest to Seals instead. With an initial investment from Friedman, matched by his investing partner Daniel Gross, the Vesuvius Challenge was born and announced last year. Now, after it was announced, a number of additional successful tech entrepreneurs also pledged money to the effort, ultimately leading to the ability to offer a million dollars in prizes. And in some ways, a lot of this story is a story of how public momentum can change the destiny of things. For example, once they got wind of the contest, the Italian government offered up additional scrolls, which is something they hadn't previously done. So the way that the things kicked off, these two new scrolls offered by the Italians were scanned at a particle accelerator
Starting point is 00:05:41 in England, and those images became the basis for the contest. Now, to win the contest, contestants weren't required to decipher the whole scroll. That would have been far too much. Instead, they had until the end of 2023 to read four passages of at least 140 characters of continuous text. They were only eligible to win if the contest agreed to open source their methodology. There were a number of key inflection points along the way. One came from an Australian mathematician named Casey Hanmer. As Bloomberg writes, the early months were a slog of splotchy images. Then Casey Hanmer scored a point for humankind by beating computers to the first major breakthrough. Anmer took a few stabs at writing scroll reading code, but he soon concluded he might have better luck if he just stared at the images for a really long time.
Starting point is 00:06:24 Eventually, he began to notice what he and other contestants have come to call crackle, a faint pattern of cracks and lines on the page that resembles what you might see in the mud of a dried-out lake bed. To Hanmer's eyes, the crackle seemed to have the shape of Greek letters and the blobs and strokes that accompany handwritten ink. Now, when Hanmer published this crackle idea to the Vesuvius Challenge Discord, other contestants like Luke Ferreter, a 22-year-old college student who was a summer intern at SpaceX last year, started trying to find crackle in the other images. Now, the important thing here is that while a tiny percentage, just one to two percent, had this sort of recognizable crackle, contestants like Ferretter were able to add those letters to the AI models that they were developing in order to train
Starting point is 00:07:02 it to recognize more. Again, as Bloomberg writes, the model starts with something only a human can see, the crackle pattern, then learns to see ink we can't. A quick message before we get back to the episode today. At this point, you guys know that Notion is one of the major tools that I use day in and day out across the breakdown network, the AI Education Beta Project, basically anything that I'm doing in any sort of professional or entrepreneurial endeavor is going to be anchored by Notion. Now, you also know that one of the big themes that I keep talking about for 2024 when it comes to artificial intelligence is the integration of AI into our workflows. I think in many ways it's not just about which
Starting point is 00:07:41 third-party AI tool is best for any given use case, but how they actually fit into what we're doing in ways that are actually time-saving. And that's why I love that Notion now has AI so deeply integrated across its entire suite of tools, which means that it's everywhere in your entire workspace. Now, for those of you who don't know, Notion combines your notes, documents, and projects into one space that is simple and beautifully designed. It's your one place to connect teams, tools, and knowledge, so you can do your most meaningful work. Unlike other solutions, it doesn't have you bouncing between six different apps, it is seamlessly integrated, infinitely flexible, and incredibly easy to use. Now, with the new fully integrated notion AI,
Starting point is 00:08:18 you can work faster, write better, think bigger, and take care of tons of tasks that might normally take you minutes or hours in just seconds. One of my favorite use cases is to use Notion for brainstorming. So, for example, what would a great launch strategy be for some project? Use Notion AI to help you think through all of the different dimensions of how you could tell that story. Now, the proof is in the pudding, and Notion is used by over the first. half of Fortune 500 companies. And most importantly, probably for you guys, the teams that do use Notion, send less email, cancel more meetings, save time searching for work, and reduce spending on tools. Right now, you can try Notion for free when you go to notion.com slash AI breakdown.
Starting point is 00:08:54 That's all lowercase letters, notion.com slash AI breakdown to try the powerful, easy-to-use Notion AI today. And of course, when you use our link, you're supporting the show. One more time, that's notion.com slash AI breakdown. Now, Bloomberg also does a good job of describing how different a pursuit this is than, for example, training in LLM. They write, unlike today's large language AI models, which gobble up data, Luke Ferreter's model was able to get by with crumbs. For each 64 pixel by 64 pixel square of the image, it was merely asking, is there
Starting point is 00:09:25 ink here or not? And it helped that the output was known, Greek letters squared along the right angles of the crosshatched papyrus fibers. So in August of last year, when a new image that had lots of crackle was shared on the Vesuvius challenges Discord, Ferretter connected remotely to his dorm computer, put the image into the machine learning system he had been developing, and when he took out his phone again an hour later, he found three Greek letters. Pretty soon, he had found 10 letters, including what the classicists involved in the competition identified as the Greek word for purple. Now, this is the
Starting point is 00:09:54 point at which I first talked about the contest on this show, and the combination of the crackle discovery from Hanmer, plus finding the word purple by Ferretter, set off, as Bloomberg puts it, a new wave of enthusiasm among contestants, some of whom began to employ similar techniques. As this was happening, Ferretter teamed up with two other contestants' use of Nader and Julian Schillinger to better combine their efforts. Now, at the end of last year, the challenge received 18 entries for the Grand Prize. The winning entry from the team of Ferretter, Nader, and Schillinger was not just four sets of 140 character contiguous text, but 15 columns of text with entire paragraphs contained within them. While scholars are still translating the text right now, they believe.
Starting point is 00:10:34 believe it to be a work by Philodemus, who is an Epicurean philosopher who tended to focus on a very here and now and present sense of the world. For example, the scholars believe that this text is focused on the pleasures of music and food and their effects on the senses. Now, one interesting side note, given that Philodemus, the Epicurean, could be the portent of a total new unlocking of ancient history. Back in 1417, a book hunter from Italy named Pagio Bracolini had discovered a poem by Lucretius called on the nature of things. The discovery was the subject of a book called The Swerve, how the world became modern by Stephen Greenblatt.
Starting point is 00:11:13 And Greenblatt basically argues that the discovery of this poem by Lucretius, who was one of the best-known Epicureans, was a huge influence on the sensibility of the Renaissance. As the description of the swerve writes, it was a beautiful poem about the most dangerous ideas, that the universe functions without the aid of gods, that religious fear is damaging to human life, that pleasure and virtue are not opposites but intertwined.
Starting point is 00:11:34 and that matter is made up of very small particles in eternal motion, randomly colliding and swerving in new directions. Its return to circulation changed the course of history. The poem's vision would shape the thoughts of Galileo and Freud, Darwin, and Einstein, and in the hands of Thomas Jefferson, leave its trace on the Declaration of Independence. So for the history geek and me, the parallelism between that discovery, 606 years ago and this one, is just at least a little fascinating. Now, there are lots of challenges from here. There is a modern town built on top of the site where Herculaneum was. There are numerous properties and buildings that are atop this specific villa, and there's no guarantee that there are the tens of thousands of scrolls that
Starting point is 00:12:11 many scholars think there might be. Bloomberg concludes, barring a mass relocation, Friedman is working to refine what he's got. There's plenty left to do. The first contest yielded about 5% of one scroll. A new set of contestants, he said, might be able to reach 85%. He also wants to fund the creation of more automated systems that can speed the process of scanning and digital smoothing. He's now one of the few living souls who's roamed the villa tunnels, and he says he's also contemplating buying scanners that can be placed right at the villa and used in parallel to scan tons of scrolls per day, said Friedman, even if there's just one dialogue of Aristotle or a beautiful lost Homeric poem or a dispatch from a Roman general about this Jesus Christ guy who's roaming around,
Starting point is 00:12:46 all you need is one of those for the whole thing to be more than worth it. And so upon the announcement yesterday of the challenge winners, they also released a blog post called the Master Plan, Vesuvius Challenge Stage 2. The two big problems, they say, are segmentation at scale, and scanning at scale. Under segmentation, they write, the current bottleneck is tracing the papyrus surface inside the scan of the scroll. We call this segmentation. Currently, we use manual tracing aided by various algorithms. This is quite expensive about $100 per square centimeter. We spent about $200,000 so far to trace enough material to read the 15 partial columns of text that were revealed in
Starting point is 00:13:19 2023. Basically, that means that with current techniques, it could cost between $1 and $5 million to unroll an entire scroll, which would mean billions of dollars to unroll all 800 scrolls. So in stage two, they say, we are going to focus on solving auto segmentation. We believe it's possible to bring the cost of segmenting an entire scroll to $5,000 or below. Their 2024 target is to read 90% of scrolls 1-2-4 and offer a $100,000 grand prize for any team that achieves that milestone. The other challenge is the scanning. Right now, between transporting scrolls two at a time from Naples and custom 3D printed cases, to scanning them with a particle accelerator in England, the cost for each scroll to be scanned is $40,000. That means it would cost $30 million to scan all $800.
Starting point is 00:14:00 This is what Bloomberg was talking about when they said that the plan is to try to see if they can actually scan on site. All in all, their goal for the end of stage two to take place in 2024 is to have read at least one entire scroll. Once they've got that technique, stage three is to scan all 800 scrolls, which they hope can be done in two to three years. And stage four is to create enough momentum that there is the political will to excavate the rest of the villa. And so that is the story of the Vesuvius challenge from here. Like I said, a pretty unique moment and a totally different type of use of AI that shows to me just how fundamentally human this technology can be. Thanks for listening or watching as always. Until next time, peace.

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.