I was looking over the impressively improved RDA Part A Chapter 3. I still have some issues (natch!), but it's clear that there has been a lot of thought about data elements and how they differ from simply strings of text.
One of the areas that needs to be thought through is how RDA will be able to change over time, something that is often called "extensibility" of a data standard. I was looking, for example, at the carrier types (184.108.40.206.2), trying to think of what future technology might not be covered. Then I ran into it at the office supply store, where I was replenishing my store of brightly colored sticky notes: you can now purchase tax software on a thumb drive.
(Note: of the carriers listed in RDA A/3 there is something called a "computer chip cartridge". This could be the term that would be used for the thumb drive, but the only examples I can find relate to Nintendo cartridges. So I'm going to pretend, for the purposes of this discussion, that thumb drives aren't covered. Even if they are, something else new will come along, and probably soon.)
RDA has a list of carriers, broken up into large categories like "audio carriers" and "computer carriers." If the item you have in hand isn't on the list, then you are instructed to use "other audio carrier," "other computer carrier," etc. Which means that anything really new will end up in the "other" category, which isn't terribly useful. It also means that something that gets coded as "other" today will have to be updated when the list catches up, but your search on "other computer carrier" will bring up a list of items that may be very different from each other. So there needs to be a way that such lists can be extended quickly, even in a provisional way, to keep up with this fast-changing world. There also needs to be a way that people in the field can find out that the list has been updated.
There are many different ways that you can develop extensibility for a set of data elements. The main thing is that you want the newly minted term to have a clear context (what list does it belong to?), and you want to be able to get people to the definition of the term when they encounter it. In this case, the context is that it is a carrier of information, and it is specifically a new kind of computer carrier. It is also extending an existing list, say, the RDA carrier list.
Let's pretend that we have a registry of terms. And let's pretend that the registry has some management mechanism, such as a small group of participants that oversees the various lists in the registry (so it's not total anarchy). Our thumb drive could be added such that:
returns this information in a machine-readable format:
sublist: computer carrier
element: USB flash drive
date added: 2007-03-30
description: "USB flash drives are NAND-type flash memory data storage devices integrated with a USB (universal serial bus) interface." (quotes because I took that from wikipedia, but generally the expert adding the term would write a suitable description.)
synonyms: thumb drive, jump drive, flash drive
The many products based on RDA would make use of the registry to support the creation of new records and the reading of existing records. With some periodicity, these systems would check that their lists are up to date (like the automatic update of virus lists in your anti-virus software). Such a system could decide that provisional entries would be flagged in some way (maybe they would show up as red on the screen). Or a system receiving a record with a previously unknown item in an authority list could quickly grab the description from the registry and use that to provide services, like definitions and synonyms, to its users.
OK, I'm sure that there are geekier folks out there who could (and hopefully will) point out what parts of this don't work, but I'm mainly interested in exploring the general concept: can we get away from text lists and create something that is dynamic, machine-actionable, and useful?