Advent of Computing - Episode 137 - Edge Notched LIVE

Episode Date: August 18, 2024

LIVE from VCF West 2024, my talk on edge notched cards! Since this is a live recording from an auditorium the audio is a little boomy, so be warned. Actually, I'm pretty sure this is the same space th...at CHM uses for some of their oral histories. What I have today is just the audio component. VCF will be posting a full video eventually, which I'll be sure to pass around.

Transcript
Discussion (0)
Starting point is 00:00:00 Welcome back to Advent of Computing. My name is Sean Haas, and I am breaking my usual format. This episode I'm calling, I think, Edge Notched Live. It's a recording of my presentation from VCF West a few weeks ago. Now, if you've been listening to the show for a long time, this is kind of going to be a rehash of some things I've been covering for quite a while on my quest to understand early data management. However, you may also enjoy this episode. It's a consolidation of a lot of the work I've been doing towards Understanding Edge Notched Cards. It puts a lot of bits and pieces I've been pulling together into one place and makes a little nice presentable package. At least, that's what I was going for.
Starting point is 00:00:45 In that sense, it's kind of similar to the live episode that I recorded when I was down at VCF SoCal at the beginning of this year. Now, before I roll the tape, I do have a couple announcements and kind of state of the pod stuff. The state of the pod is, as usual, really well. VCF West was fantastic this year. I'm really glad I decided to go and got the chance to speak. I keep having the humbling experience of meeting listeners at these in-person events. So if you made it out, I'm so glad I got to meet you. Thank you so much. Y'all are really why I keep doing the show.
Starting point is 00:01:20 Thank you so much. Y'all are really why I keep doing the show. Also, since I have the audio recorded from VCF, I decided to take a bit of a break for the last two weeks. I haven't been working on production, which has been very nice. I've actually been able to do some retro computing stuff. I've been working on restoring some of my old systems. But that said, I am jumping back into my normal two week production schedule as of well, as of today, as I'm recording this. So there's going to be just a normal,
Starting point is 00:01:54 plain, not bite sized, not live episode coming out in two weeks, then I'm just back to the usual grind. But it has been nice to take a little bit of a break. That segues nicely into my other announcement, which is I'm going to be taking an honest-to-goodness real-life vacation in October this year, which I wasn't thinking when it was initially planned, but October traditionally is my only event month on the podcast. It's spook month. It's all Halloween-themed episodes. Perhaps you see the issue here.
Starting point is 00:02:29 I'm going to actually be out of the country on a vacation for half of October this year. I wasn't really thinking ahead. In classic Sean style, I don't plan things out very well. My big brain plan that I've come up with, which I'm going to stick to since I'm saying it out loud now, is I'm going to kind of change up my format for this spook month. Instead of doing two large episodes, I'm going to do four small episodes. Those are easier for me to produce and kind of slide in as space opens up in my normal life schedule. So this year, we're going to have four small episodes,
Starting point is 00:03:09 each on something just a little bit spooky. So it'll be classic fare, just spread out. I think that'll work with my travels and work and my normal life and production. So between now and then, if anyone has ideas for spooky episodes, I have some I'm working on already, but I'm very open to anything high strangeness or spooky or annoying around computer history. Which takes me to my final announcement, which is something very annoying. If you're a patron or use Patreon, you may have seen the news about Apple and Patreon. It's not very good. Essentially, what's going on is Apple is forcing Patreon to charge 30% extra on the iOS app.
Starting point is 00:03:57 So that means if you're a new member and you join any Patreon using iOS, you're charged an extra 30%. And they're also changing how some of the scheduling around billing works. It's all annoying and kind of confusing. So my recommendation is if you're ever considering being a patron, do not do it through the iOS app. Just do it online. That doesn't charge you an extra 30% for no real reason. So if you feel the need to join up on my Patreon, get some bonus episodes, get some early releases, do it using the website. Don't do it on iOS, please.
Starting point is 00:04:40 Now, with all of that said, I'm going to start rolling the tape. This is my talk from VCF West 2024. My name is Sean Haas. As was just announced, I'm a freelance historian. I normally host and produce the Advent of Computing podcast, where I cover the obscure and less well-documented history of computing. The reason that I tend to veer towards that is because there are large swaths of computing history that are very well covered, like the history of Apple and Microsoft. Everyone knows that. There's dozens of books.
Starting point is 00:05:20 But there's these huge amounts of the historical timeline that just aren't preserved or are sitting in archives and haven't really been looked at yet. And we're just now hitting a far enough point away from those events that we're starting to get the context and enough resources that we can actually do real history about the computing revolution. So today I'm going to be sharing with you one of those stories that I've been digging into for quite a few years now. It's the story of pre-digital hypertext and how hypertext initially was developed. Just to give the broadest of outlines of the discussion today, we're going to be going from these weird-looking card stock things that look kind of like punch cards but not really and working our way up to the first automated digital links on a computer. That means that we are talking early hypertext.
Starting point is 00:06:18 We're not going to be talking about the Internet at all today. Once again, the reason for this is that transition from early hypertext systems to the modern web, it's really well documented. That was in living memory for many people. But this earlier period, it's older, it's much less well documented, and it hasn't really been treated academically yet. So my research is primarily looking at these periods that haven't been given proper treatment. But as I said, it's early hypertext,
Starting point is 00:06:49 and that's very different than the modern web. So I want to start off by going over what some of the key features of early hypertext are just so we're in the right mindset, so we're not thinking like, where's the YouTube on this? Why can't I like these cardstock slips? Where's the YouTube on this?
Starting point is 00:07:04 Why can't I like these cardstock slips? The crucial idea that we need to understand when talking about early hypertext is the information problem. This was coined by Vannevar Bush in this paper called As We May Think in 1945, but it's something that existed by different names, I think, ever since humans had the first libraries. So there's probably someone over in Alexandria going,
Starting point is 00:07:28 oh, gee, I got an issue with this stuff. The most succinct way to describe the information problem is that too much information is the same as not having any information. We reach a point as data piles up, as sources get shelved, where a single person, unaided, can't manage all that information. You can't remember where everything's stored. The problem that leads to isn't just you can't find your favorite book, it's that progress grinds to a halt. There's an old adage that we stand on the shoulders of giants, but if you can't find what those giants were writing, you can't really get on top of their
Starting point is 00:08:05 shoulders. So that's what Vannevar Bush started seeing in the 1940s. He was the director of the Manhattan Project, and he noticed that a lot of his researchers were spending more time doing background research and struggling with that aspect of their work as opposed to actually progressing the field. Specifically, Bush envisioned hypertext as a possible solution to this, as a way to better manage information so people will be able to deal with this larger and larger pileup of data. To get into the specifics, I want to look at Vannevar Bush's solution, which comes out of that 1945 paper, and that was this theoretical machine called the MIMEX. Now, I'll be going over three core features to early hypertext.
Starting point is 00:08:51 These features are present in most early hypertext implementations. I just chose the MIMEX because it's one thing, it's pretty self-contained, and it's pretty well documented. So this gives us an easy jumping off point without getting into my wild speculation immediately. The general idea of the Mimex is that it's this desk that automatically stores books. It's envisioned as this thing
Starting point is 00:09:14 that would store an entire library just in a normal size desk, very unassuming. What's cool about it and what makes it hypertext is that instead of just sequentially looking through data, you'd be able to pull up a page of a book and link that to another page, or pull up an idea that you've loaded into your desktop library and link that to another idea. By doing that, you can construct these trails of thought where you'd be able to jump from one idea to the next to the next. And the idea is you're connecting data as associations
Starting point is 00:09:50 instead of in meaningless sequences. The first hallmark of all early hypertech systems is they adopt new technology. The Mimics in 1945, it's not built, but Bush designs it around what else but microfilm. At the time, that was a wildly futuristic technology. Bush had actually worked with microfilm prior to World War II in selection machines, the idea being he could rapidly select specific slides from microfilm. So he knew it was just on the edge of possibility to have automatic linking between slides of microfilm. The second feature is freedom.
Starting point is 00:10:35 And this is a bit free as in free beer, but free as in freedom is the better adage here. This is based off microfilm. That means that your data set is. This is based off microfilm. That means that your data set is anything you can store on microfilm. There's no concept of data encoding because each idea that you link together is just a slide. That means you can have scans of a book, you can have photos, you can have handwritten notes, you can even have scans of books where you overwrite handwritten notes on top. Part of the design of the Mimix is this flat platen where you could scan a book or you could just take a stylus and write down some notes and then load that into the system using wild new
Starting point is 00:11:15 technology like dry photography. And the third hallmark, and this is the weirdest to think about in the modern context, is that hypertext is highly personal. I don't mean that as in you have the background you want or the nice CSS styling that looks pretty on your MySpace page. I mean this was envisioned as a deeply, deeply personal system. You start with nothing. In the case of Memex, you start with an encyclopedia or maybe a dictionary, but you don't have any hypertext. You don't have any connections. You, as the individual user, have to go in and make those connections.
Starting point is 00:11:54 It's personal, but you have to personalize it. That means that there's this weird learning curve with hypertext systems. It's not something you sit down and you're ready to use. A lot of other systems like NLS, you boot up and you are literally greeted with a blank screen. You're expected to use the tools that hypertext gives you to personalize your data set. The reasoning here being that this isn't some one-size-fits-all system. It's this augmentative system. It's supposed to help with your thought process and help you manage information. So in that way, it's much more personal than something like the modern
Starting point is 00:12:29 day internet. Now, that's just a quick rundown of what I think are the core features that make early hypertext really unique compared to modern hypertext, but also for the time revolutionary. really unique compared to modern hypertext, but also, for the time, revolutionary. But whenever I see nice little lists, nice little descriptions of something that's cool, I always think, is this unique? Was this the first thing to put together free-form data, high personalization,
Starting point is 00:12:58 and really adopt new forms of technology to those ends? And the answer is no. The MIMEX and later hypertext systems aren't unique in this regard. There's actually a very rich tradition of super similar systems in what I think is the most exciting field in the world. It's in accounting and insurance. Where else do you find high technology, am I right? All jokes aside, though, banking, accounting, insurance, and some isolated government tasks like census bureaus were the first places to really feel the information problem in the modern era.
Starting point is 00:13:41 And if you step back, that makes sense. These are the industries where you actually do have a wild amount of data that on a day-to-day basis, you have to be able to manage somehow. What's really cool here is there is a rich history of attempts at managing information in better ways in these fields. This starts out in around 1890 to 1919. So there's this 20-year kind of fuzzy time period where people in accounting and insurance, mainly, are designing these automated systems for file management
Starting point is 00:14:17 and for data separation and even for data selection. I want to go through a few of those because this is where we start to see the earliest roots of what might be a glimmer of hypertext. The first is the rejection index. I have that in big scare quotes because this is an anachronism I've made up. There's no trade journals
Starting point is 00:14:37 where people in the 1890s are talking about, oh, in my accounting practice, I've implemented this new rejection index after the recommendation of my associates. are talking about, oh, in my accounting practice, I've implemented this new rejection index after the recommendation of my associates. We know about these primarily through a few letters and a handful of patents. So I've just kind of applied this rejection index name as a way to group them all together after the fact. These are very simple systems where you'd have a normal filing cabinet where you would have compartments that had a
Starting point is 00:15:05 series of these notches on the bottom, little bumps. In this specific patent, they have a series of cabinets where each cabinet has different compartments. And as you can see, they have these little bumps in the bottom, and each compartment's unique. You'd pair those with note cards that had matching notches on the bottom. How this would work in practice is if you tried to put a card with notches in a slot that didn't match up with it, the filing cabinet would reject that card. You'd be physically unable to misfile a card. In practice, this would have been used for things like keeping track of bills or keeping track of invoices.
Starting point is 00:15:51 There's these great examples in patent filings where people have these 12-drawer filing cabinets, one for each month and a series of notching unique to each month. So if you are going in to file a February card in a January slot, which no one would ever do that, but let's say hypothetically it's a bad day. You couldn't do that. Your filing cabinet would prevent you. So in that way, one of the big problems with the information problem is mismanaging data. Humans make errors all the time,
Starting point is 00:16:14 and this is a way of automating away some of that human error. The other aspect to this are patents for full selection. This in particular is something that is really exciting to me. There are a lot of different ways people did this. The earliest ones and some of the most, well, it wouldn't work in real life, but it's a good try, are these tabbed cards. So this patent is from 1896, so pretty early days. How these work is a card would have a series of tabs on the top
Starting point is 00:16:50 where the tabs would have holes in the middle. Now, there's no information on, would you get a card and then cut off the tabs you don't want? Would you apply tabs above it? How would you deal with registry? So there's some feasibility issues. But what's neat about this is it allowed you to store unorganized cards in a drawer. Then when you want to make a selection, instead of having to go to one of 12 drawers, you'd open your drawer,
Starting point is 00:17:17 you take a selection needle, a glorified dowel, and you'd find the tab you want and pull up. This, once again, invoices are on the mind, so a lot of these patents show these being used to group invoices by month. So in that example, you could keep your invoices unorganized, and then if you wanted to check January's invoice numbers, you just pull open your drawer, slide your selector through the January notch and pull up, and you have all your cards.
Starting point is 00:17:49 You don't have to sort. You don't have to file. The actual operations you're doing are automated away. There's also just some really weird patents where people are trying to kind of style on the technology and put it together. This Hargrave patent is one of the most atrocious of the lot. It has a combination of rejection indexes on the top, selection on two of the sides,
Starting point is 00:18:12 an inverse index on one side, so you could use custom filing cabinets with all kinds of sliders and drawers. But eventually, they converge in the 1920s. That's when we start seeing something that's more familiar, at least something that doesn't end in a dead end. This card from Perkins is the perfect example. It's about the same dimensions as a punch card. It also has a registry cut on one corner, much like a punch card. That would have been used to make sure that when you have a stack of cards, they're all facing the same way. Note that in the middle, though, there's nothing. It's just a blank piece of paper. It's just a little note card. You can put anything you want on that. Around the edge is a uniform series of perforations. And you'll also note that some of them have been cut into notches. That's how information is encoded on these cards, which we'll get to in a minute. The other thing I want to point out before we move on
Starting point is 00:19:09 is certain segments of those holes in this patent diagram have been broken out into fields. That's going to become important in just a minute here. Now, throughout the 1920s, companies start licensing some of these patents and creating what are known as edge-notched cards. That name kind of gets applied colloquially. These are barely related to punch cards. They come out of this totally separate filing tradition and not from this number-crunching tradition.
Starting point is 00:19:41 They also only encode metadata, and it's stored on the edge. So there's no data encoding on the face of the card. You're keeping information about your information by encoding these notches and intact perforations on the edge of the card. This is, once again, substantially different from punch cards because there's no data encoding. You're just keeping track of little, just a few little bits of information. These are also fully manually operated. There are no electronic machines to help you manage your notch cards. There are fancy machines for making these, but they're essentially these kind of insanity drill presses. I've seen some photos in collections. They're these drill presses that have
Starting point is 00:20:26 geared arrangements so they can bring down multiple drill bits at once to bore out the holes in a stack of punch cards. It's wild technology, but outside of that, if you're just using these day-to-day, there's nothing electronic about it. With just this setup, just these simple pieces of cardstock, you can sort, index, reference, and select note cards. And something else that is kind of interesting and also will become a problem is there's no standards around these. The only standard is roughly the shape and the dot pitch,
Starting point is 00:20:59 but beyond that, you can encode your metadata however you want. You can put anything you want on the face of the card. I've seen some cards in collections where people have even taped Polaroids onto their Notch cards. So it makes for this really interesting medium that has a wild amount of expressive capability. Now, I just want to drive home a couple of these points.
Starting point is 00:21:22 The first is, these aren't punch cards. The reason I want to drive this home is when I first started researching edge notched cards, I assumed, oh, it's card stock. There's some data on it. It's about the same time that Holorith starts making punch cards, early notched cards appear. So they're probably related, right?
Starting point is 00:21:40 After a large amount of patent surveying and hitting the sources, they're not. And part of that is just they're ideologically totally different tools. Punch cards encode data. Edge-notch cards don't. They only encode metadata. Punch cards store data on their face as encoded information, which isn't even a concept in edge-notch cards.
Starting point is 00:22:02 The big thing is that punch cards are used to do data reduction tasks. These solve the information problem, but they do that by taking a large amount of information and turning it into something digestible to a smaller amount of information. When you do a tabulation, you're taking hundreds of thousands of cards to get an answer out of them. You're not plucking out just one card to check. They also require machines. If you don't have a tabulator, you can't do a tabulation. You can't just grab a stack of punch
Starting point is 00:22:31 cards and go, oh yes, I can scan through these really quickly and the answer you're looking for is five. That doesn't work out very easily by hand. Because of that, punch cards rely on standards. You have to have a card that's a very specific shape, has very specific punching, or you can't use all the automated machinery. So ideologically, punch cards and edge notch cards are in completely different worlds. The other point I want to really drive home
Starting point is 00:23:00 is when I said these are manual, I really mean it. The automation here is handled by two tools. The top tool is this beautiful little hand punch. It's similar to what you might have seen on a train back in the day. But you don't actually need to use that. This would have been used to turn intact perforations into notches. But you can also do that with scissors. In the historical record, there's a lot of accounts of people just using scissors. I actually don't. I'd be curious to see sales figures on hand punches.
Starting point is 00:23:30 The other tool you need is a selection needle. If you went to an official supplier like McBee or Copeland-Chatterson, you can get these beautiful Bakelite-handled needles. They're made out of spring steel. There's a little set screw so you can change the length. They even have a logo embossing gold on the handle. Or you can use a knitting needle. You could use a shish kebab skewer. I've actually used toothpicks for some small decks. So really, you don't need to invest in hardware. The only thing you need to invest in are the cards. And once you have those cards, you have everything you need to get going.
Starting point is 00:24:06 Data here is encoded in something very similar to binary. As far as like ones and zeros, trues and falses, it is binary, but some of the encoding details are wacky. The simplest encoding schema is a dichotomous key, where you just have, you choose a position and you set a hole to be a false and a notch to be true. So if you're doing invoicing in this, which would be the most common application by and large from everything I found, you might have a position that represents paid versus non-paid invoices. As invoices are paid, you pull it out of the stack, you take your little hand punch or a pair of scissors, and you turn the invoice position from a hole to a notch.
Starting point is 00:24:50 Then come up into the month, you need to see which invoices haven't been paid, so you can put those into a pile of delinquent invoices. You'd take your pack of invoice cards, pull out your knitting needle, or maybe clean off the skewer you were just having some lunch off of, and you go to the paid position, you slide it through and pull up. Any card that has an intact hole in that position stays on that needle and is removed from the pack. If it has a notch, it falls off the needle and stays in the lower pack. By doing this, you can make a selection from an unordered set of cards. You don't have to file these cards. They can be in any order you want when you do a selection operation.
Starting point is 00:25:29 There's also a really simple algorithm. It's actually O of 1 as far as complexity goes if you use big O notation. So for any amount of cards, if you're doing a selection on a dichotomous key, that's one operation. It will get more complicated, and there are limitations as far as how big your skewer can be and how many cards you can hold in your hand. But in general, this is a wildly simple selection method.
Starting point is 00:25:58 So at this point, we can start to pull together some ideas here. I'm hoping, if I've done my job, it should be clear that there are some features here that are similar to early hypertext. This is pretty cutting-edge technology for the time. At this point, there wasn't really another way. We're talking 1920s, 1930s. There wasn't really another way to automatically index information like this. It's also very freeform because your data is just a piece of paper. You can put anything you want there. You can paste on news clippings. You can
Starting point is 00:26:31 do little drawings or even circuit diagrams. And they're also known for standards. So it has that nice dual factor of personalization. It's highly personalizable, and you also have to personalize it to use it. So is this actually hypertext? It has all these features, but where's the evidence that edge-notch cards are hypertext? That gets to the issue with this specific research, or rather, issues. There are many. Here's the two worst ones.
Starting point is 00:27:03 There are many smaller ones I can tell you about later if you catch me around the fair. The first is edge-notch cards kind of just aren't preserved very well. Now, this is speculation, but I believe a lot of them are thrown out. As I keep saying, these are used in banking and accounting and insurance, all these back office applications. That's what they're primarily marketed towards. It's a way to modernize your office. Are you going to keep around your tax records for 60 years and donate it to your old alma mater's archives? Probably not. So because of that, once again, I can't prove a negative, but I think a lot of these were just thrown out. Also, when you factor in things like medical records or grant records that were stored on here that might have implications for personal information, there may have been actual policies
Starting point is 00:27:50 in place to destroy records after a certain number of years. And these are just back office records. Edge Notch cards are also really easy to get rid of. Since they're super cheap, you can use them and get rid of them and forget about them. I've read some fascinating papers. There's actually one in the Smithsonian archive that are about naturalists using edge-notched cards for really quick and dirty data reduction. There's this one where a naturalist took notes on edge-notched cards, punched them up when
Starting point is 00:28:20 he got back to his office, and then used them to figure out how many frogs versus toads he saw in the field. He didn't keep those notes, but we have a record that he used edge and notch cards when he was doing his research. So because of this ad hoc nature, and once again, the buy-in is so low, it's a pair of scissors and a knitting needle, it's easy to get rid of them. The other thing is that archivists kind of don't know what to do with these, and this isn't a is that archivists kind of don't know what to do with these, and this isn't a knock against archivists. People who work in archives are the most important people in human
Starting point is 00:28:50 history. I love every one of them. If they're listening, you're the best. I'll buy you a beer, I'll do anything you want, but they don't really know what to do with edge-notch cards. They're not equipped for it. It's not a medium that makes a lot of sense in a traditional archive. It's not a book. It's not loose-leaf paper. It's not a punch card. It's just kind of this weird little piece of cardstock that has perforations on the edge that gets hooked on other things. It's easy to miss file. So I've talked to a lot of archivists, and they've been like, oh yeah, that might be in the collection, but we don't index by that. We don't have any information about if we have these or not. So this makes it really hard to just track these down.
Starting point is 00:29:30 Which leads to a bit of a compounding issue. So as I said, these are cheap. There's almost no upfront cost and they're super flexible. You can mix and match this handful of tools to do anything you want. It's a highly personalized system, which means it's a very ephemeral system. If someone's using this for all their personal notes and then they die, well, it's very personal. You're not really going to be able to figure out what they were doing with it
Starting point is 00:29:59 unless you have a whole lot of information up front. And that's problem two. These are very personal. There's no standard. Did I say that? There's no standard at all. That's been the bane of my existence in this research. These are just a few of the random encoding schemes
Starting point is 00:30:15 you could get printed on prepared edge notched cars. We have ways to do alphanumeric entry, ways to do different number encoding using different base sequences. One of those is a dual row of perforations. I've seen cards that go up to four. I've seen cards that have regions of four rows of perforations
Starting point is 00:30:35 and then regions of one. The sky's the limit, which means you can have really rich data sets. You can have really interesting metadata encoding, but it's also, how do you start to document this? To understand a set of edge notched cards, you need a full set of cards for just the context, but you also need the context of how they were used. You need the keys for what different fields mean. So if you hand me one edge notched card, thanks. That's not very useful. You need all this added information to
Starting point is 00:31:07 actually understand how they're being used. Luckily, there actually are some preserved sets that have really, really good provenance. One of them I was able to find with the help of Christina Engelbart, Doug Engelbart's daughter, and the Smithsonian Archive. It's actually just on the other side of the highway, a few miles away. To explain this, we need to go back to 1962. In that year, Doug Engelbart publishes Augmenting Human Intellect.
Starting point is 00:31:36 This is one of the foundational papers in modern computing. It explains totally new concepts in how to handle user interface design and how to structure data. It leads to NLS, the online system, and the mother of all demos. This leads to the computer mouse. It leads to Windows. It leads to cursors. It leads to video conferencing. It leads to all of these wild modern things. And it also leads to hypertext. to all of these wild modern things, and it also leads to hypertext. Now the paper itself is interesting, but my favorite part are the appendices. In there, Engelbart goes through how and why he arrived at his conclusions. And my favorite appendix is appendix three.
Starting point is 00:32:19 I know, anyone else love appendix three out there? Any big Appendix 3 heads? I like this because he talks about his note system. He talks about day-to-day how he wrote Augmenting Human Intellect. And he says specifically that while he was working on the paper, he kept notes on edge-notch cards. His edge-notch card system. Right after this quote in Section 3, he goes on to explain a system of linking and stitching that he developed on edge-notch cards. And we're not talking about computers here. This is before he was working in a mainframe room. He developed a
Starting point is 00:32:58 hypertext system on the pieces of cardstock we've been talking about. And the wild thing is, it's preserved. The full set of notes is actually available in Stanford's special collection. It was a lot of work to find. When you're going to special collections, you're given a finding guide. It's just a PDF that has maybe one sentence, maybe one word about what's in each box and what's in
Starting point is 00:33:26 each little file folder. They never say edge notched cards are here. They'll say something really useful like notes 1954 to 1958 or correspondence comma notes period. So you just kind of have correspondence comma notes period. So you just kind of have to look and from your context guess and hope you're right. That was the case with these cards. They were just in a big box. It was labeled as notes. But I was able to track these down in 2019. And what's crucial here is we have the context of how these were being used. That was written and published in Augmenting Human Intellect. And we also have the keys. So these two cards, well, this is just an example of what the cards look like. But this larger card with the lines on it, that's actually the schema that Engelbart was using in one series of notes. That's all the information we need to
Starting point is 00:34:26 make sense, to make heads and tails of the data set. You'll notice that there's all those single words on lines. Those are dichotomous keys. He has different categories like facts or speculative ideas. You'll also notice that he has a large set of fields there allocated for numbers. You'll also notice that he has a large set of fields there allocated for numbers. Those are especially crucial because in his note collections, each card had a unique identifier. Maybe we can see where this is going. Engelbart implemented hypertext in his edge-notch card notes by using those unique identifiers. Each card has a serial number. A link is just a number that references that serial number. You can follow that link using
Starting point is 00:35:11 a knitting needle. So in practice, he'd say, oh, you know, I have this note which reminds me about this funding statement that I read or this grant that is available that I have in this other correspondence that I took note of in this other card card number 826 or 123, and he'd just write the number down. And then when he's going back through his notes, he goes, oh, I remember there was something about a funding statement. Oh, it's in card 826. Okay, well, pull out the needle and you can pull it out of the pack. The selection for a serial number like this is a little more complicated than just a normal dichotomous key, but you can do it in a few operations.
Starting point is 00:35:51 There's also tricks you can do with how you handle your serial numbers to make those selections easier. The one just interesting thing to note that I won't bore you with the details, but 7421 encoding, this binary style encoding, is actually sortable using just a few operations.
Starting point is 00:36:11 I think it's like three needle operations can sort a pack if you're encoding your numbers using that base. Now, in practice, that's what a link looks like. That is a physical link that was on one of Engelbart's notes. It's just a number in quotation, well, in parentheses. That references the unique ID number that would be encoded on every card in the set. So in this case, he could pull up card 300
Starting point is 00:36:36 with just a handful of operations. The other feature, and this once again gets back to old hypertext being different than new hypertext, is stitching. This is something the web doesn't really do these days, which honestly I wish it did. The idea is a stitch allows you, using the same idea of this linked data, to combine pieces of information into a larger idea. Specifically how it's implemented in Engelbart's notes is he'd take that serial number and then add an order number.
Starting point is 00:37:14 So you could also look at it as like a group and series number. So here, just as an example, card 826 has parts 1, 2, and 3. And all you have to do is expand to have one more numeric field, and then you can do this kind of grouping. In practice, there's what it looks like. And I really love these cards as an example, because you can tell exactly what's going on. This is card number 129, part one and two. You can see that he ran out of space typing up on one card. He has these little overs on a lot of his cards, but you flip it over, there's more, he runs out of space again.
Starting point is 00:37:54 So he has this other card that has the same ID number and has a little sequence number added on where he finishes the idea. The idea with this, and what's so cool, is he's able to pull out a specific idea and then reorganize it, put it back in order and reconstruct the entire chain of thought. It's really interesting to note, too, that he's really making use of the medium because he hits this point where he runs out of space and instead of just going, oh'll abbreviate it i'll remember later it'll be fine he goes no i'll i have the tools to extend the data i can keep my idea going i can keep rolling with this because the system gives him all the tools he needs using this hypertext ideology so to pull this all together edge notch cards not only have similar features to early hypertext,
Starting point is 00:38:48 they also have almost the same implementation details. In the modern web, how do you access a web page? You have a unique identifier. When you reference it, you reference it by that unique identifier. I mean, a few numbers is a lot less unique than a full URL, but it's serving the same purpose. Edge-notched cards also show up at the dawn of digital hypertext. Doug Engelbart is the first person to implement hypertext on a computer. And as he's designing that system, as he's planning how he's going to implement hypertext, he's doing that on edge-notched
Starting point is 00:39:25 cards. That is a direct connection. They are, if you'll excuse the joke, the missing link to the story of early hypertext. But this is only part of the story. Edge-notched cards aren't just this weird hypertext technology that appear and disappear. They continue being used into the 1970s, if you can believe it. I actually have, and it's really weird. So I live in a pretty small town in pretty rural Northern California. Going through court documents a while back,
Starting point is 00:39:59 I ran into this price list. It has a breakdown of things that the cops in town bought in 1977. And one thing they bought was a crime database on edge-notched cards and a training course on how to use them. The cards have since been destroyed, but there's records of these being used at least as late as 1977 and just out in the middle of nowhere. So these, I personally think, it's secret backroom technology. It's something that's used in the middle of nowhere. So these, I personally think, it's secret backroom technology. It's something that's used in the back of an office
Starting point is 00:40:28 that people don't really talk about, but it's used for decades. This opens up a lot of questions. Was Engelbart the only person using these for hypertext systems? Was he the only person writing these automated links? What else were people using these for? We know they're used for hypertext, they were used for ad hoc data reduction and survey work,
Starting point is 00:40:51 and they were used for office records. But what else is going on? This is just an entire hidden field that we don't have eyes on yet. Currently, there are two fronts to this research. The first is adding depth. Now, this isn't me, but I'm very excited about it. I have a contact in New Zealand whose father recently passed. When he was going through his estate, he found a few filing cabinets full of edge-notched cards. His father was a naturalist.
Starting point is 00:41:19 He was using them to keep his field notes. My contact is currently digitizing those cards. Actually if what he says is to be believed, they should be digitized right about now. What's really exciting about this, besides having a new data point, is he's writing software to do full image analysis to extract the indexes and preserve photos of the cards. And I can't wait to get my hands on that software. That's going to allow me to take the scans I have, all the, well, all the six full sets of edge-notched cards I've been able to find, I have those digitized. Using this software, I'll be able
Starting point is 00:41:58 to pull out those indexes, store that data in a database, and actually operate on them in a computer instead of manually. Because these are old, they're crumbling. I don't really want to pull out a knitting needle and go to town on these. If I damage them, that's something that's lost forever. Also, archives usually don't let you bring pins inside when you're going through the special collection. I don't know how an archive would feel if I showed up with an ice pick and was like, hey, I want to look at box 387 from 1958, please. Can you bring that here?
Starting point is 00:42:31 No, just for a minute. It'll be fine. I'm not going to do anything bad. So I'm really excited to have a way to use this on a computer. I think that'll give us a lot more insight into practically day-to-day how these could be used. And the other side to this is adding breadth. And this is the end of my presentation and my call to action. Have you or a loved one seen these cards? Back in the 70s, did you have a week at work that was ruined dealing with these? Do you know someone that manufactured these, someone that threw these out? If you have, please get in touch with me. I'm going to be here all weekend. I have business cards. You can email me. I'm trying my hardest to find more of these cards because there's so few data points about them. The last academic work, well, academic, the last industry work on edge-notch cards was
Starting point is 00:43:26 published in 1954. That means there's this huge gap where we just, we don't have a centralized understanding of what they were used for. I think in the future, in the coming years, as I collect more information about these, as we start digitizing cards, as we start using them on computers, as we start finding more cards, there we start using them on computers, as we start finding more cards. There's going to be a really fascinating story here. And as we've seen with the connection to hypertext, we're just starting to scratch away at the surface of that. Thank you so much for listening to my talk. If you want to hear more of my ramblings, I host Adren of Computing. It's a podcast. You can get it anywhere anywhere and if you've seen these cards
Starting point is 00:44:05 please come find me I will reward you handsomely I'll at least give you a firm handshake and a thank you applause applause applause applause do you want to take any
Starting point is 00:44:22 questions? I would love to take questions alright so if you have questions, please raise your hand. I'll come to you just so that it's recorded. Hi, Sean. Thanks very much for the talk. It was wonderful. I'm a big fan of the podcast.
Starting point is 00:44:45 Thanks. While doing some research on actual punch cards, this is related but not exactly about edge cards, I found it very difficult to find any sources that would describe how you actually did work using a stack of cards. I mean, I have a stack of cards as sales contracts. Yeah.
Starting point is 00:45:02 Why they didn't turn that into something useful? The best source is this. I actually have little citations to it everywhere since almost all my images came from here. There's this book from the first edition was in 51, the second in 54, I think. It's called Punch Cards, Their Application to Science and Industry. It's a collection of publications from trade journals and academic papers that's on edge-notch cards and punch cards. It's scanned on the Internet Archive for free. I highly recommend that. I have a personal copy that I think I've dog-eared to oblivion.
Starting point is 00:45:36 But that has full explanations of day-to-day how people are using them. Thank you. Hello. I'm fascinated not only about learning the system, but how far it propagated throughout the world. You mentioned that you've heard about collections of these cards in New Zealand. Yeah. I was wondering how did this system get propagated?
Starting point is 00:46:06 Did he present papers, presentations? I guess many of us know about the infamous Mother of All Demos. Did he do presentations about this particular system and academic papers? Oh, you just mentioned some. That's a really good question. So Engelbart didn't present about Notch Cards. They do show up in Augmenting Human Intellect, like I said. So that was part of the propagation. There are also just ads. There were a lot of companies that manufactured edge notch cards. One of the biggest was McBee, which later becomes Royal McBee.
Starting point is 00:46:35 They ran ads everywhere. You can still find a lot of images of, they're really campy. They'll have like photos of someone in a business suit standing astride the world holding a nosh car being like simplify your business solutions conquer the world um so that was a big part of the spread there's also a lot of academic disciplines chemists used these a lot there's actually in punch cards or applications science and industry there's some fascinating papers about how chemists were encoding information about compound structures on the indexes in notch cards they also i think some of the spread was people doing research in different areas of the world there were um i'm gonna mess up the details since I haven't read the paper in a while, but there were some TB studies carried out in India during the British occupation that used
Starting point is 00:47:33 edge notch cards. And they were using, I think, Copland-Chatterson cards, which were manufactured in England. So they're spread via that vector, but in general, they spread by industry goals, kind of. So the cards that were found in New Zealand, have you seen any of those images yet, or are you still waiting for the entire project to be scanned? I've only seen little phone camera images so far. I'm waiting for the full scans. I know that the code's going to be on GitHub,
Starting point is 00:48:14 and he was saying that he's donating the full set to a natural history museum. But I'm still waiting on the full photos. So the reason I ask is similar to the previous question. Were they significantly different from cards you've seen in the U.S.? No. So that's what's really interesting. And this is one of the open questions right now that I need to go on a couple trips to a few archives to finalize this.
Starting point is 00:48:43 I need to go on a couple trips to a few archives to finalize this, but most cards, if they have that registry cut, if they're related to this patent, so I think people were licensing this Perkins patent, and some have these circular cuts at the top. There are some that have a straight cut at the top that I think weren't under license. But in general, around the world, they all look very similar. The form factor ends up being a little more square than
Starting point is 00:49:11 the earlier patents. But in general, they all look roughly the same. That was kind of a rambling answer. I hope that answers the question, though. Hi, thanks for your talk. In terms of early paper-based real hypertext, would you consider the card systems of Paul O'Leary in Belgium in the 1800s to be or to not be hypertext? My man, 100%. I personally think if they're not real hypertext, they're really, really close.
Starting point is 00:49:55 Because he was using, I guess, for those who don't know, he was using a very much more sophisticated indexing system similar to the Dewey Decimal System, but was able to encode much more information about data. And that was used for referencing and connecting data, which is, that's really hypertext-y. So yeah, I consider it hypertext, at least. Thank you.
Starting point is 00:50:27 Edge-notched cards sound awesome. Do you know can you still buy them? I wish. You can't buy new ones. They show up from time to time on eBay. You can get them there by the one and two. I've actually talked to a couple of people that are trying to manufacture some just for personal use and it's not very easy since you do have to have registry
Starting point is 00:50:49 and I've seen machines that are still extant for cutting these out but I don't know if they work so as far as buying new ones no, best you can hope for is new old stock. It looks like most of the usage was predominantly on business and academia. Was there any effort at all for people to use this in a home or an entertainment setting? I've read, so I, you could probably use these for a choose-your-own-adventure novel. I haven't ever seen them used, well, that's not entirely true. There were a couple of sets that were sold as
Starting point is 00:51:34 field guides, so there's like an edge-notch guide to birding or an edge-notch guide to rocks that I found, where it was for personal use. As far as pushing people to use these as a personal filing system, the closest I've seen is there's this really slim book I tracked down that was, I think it's just called an introduction to personal filing, that was trying to advocate for these as for keeping your personal taxes. But I haven't, excuse me, I haven't seen much about people using these for like journaling or in the home
Starting point is 00:52:08 besides small time business stuff. Any other questions? Looks like that's it. Thank you very much. Thank you. Thank you very much. Thank you.

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.