The thing called "FRBR-izing" is really the creation of a set of records that all represent the same work. So rather than having a display that shows all of the different editions of the work separately:
... you have a single display for the work, that links to all of the editions. Different systems display this differently. Here is the OCLC FictionFinder display:
And here is the beginning of the display of all of the many editions:
This creates a two-tired database with Works and Editions. (Note: Editions are called "Manifestations" in library lingo.) The two big questions are:
Note that FRBR-ization affects only a small percentage of bibliographic records. OCLC's statistics show that 78% of the items in WorldCat are unique Works. Only 1% of Works have up to 7 Editions, and only 30,000 in their database have more than 20 Editions.
There is no definitive answer to what is a work, especially when it comes to changes in format, such as a book that has become a screenplay and then is made into a movie. But since we only have books in the OL database at the moment, the task is somewhat simpler: bring together books that are essentially the same text. Basically, the elements that define a work are:
This isn't quite as simple as it seems because ideally one would also bring together different translations of the same work, and of course those do not have the same title. In some records that we receive from libraries there will be a special "work title" that contains the original title of the work regardless of the language of the translation.
Mann, Thomas
[Zauberberg]
Magic Mountain.
Mann, Thomas
[Zauberberg]
La Montagna incantata.
There are also Works that are the same but have been printed with different titles at different times or in different countries, such as the works of Shakespeare and Harry Potter. The work titles (called "uniform titles" in library lingo) are unfortunately not used consistently even in library records, and don't exist at all in records from our other sources. At some point we will have to rely on users to bring together works that do not get identified algorithmically. We also have a set of ISBNs from LibraryThing to use, and could probably make some use of the xISBN service from OCLC. This, however, only helps us with works that have an ISBN.
In terms of an algorithm, OCLC's work set algorithm is available. However, it makes use of some data elements that we will not have, in particular those that OCLC derived from LC Authority records.
The Work-set display and the Edition display will make use of different fields. A page on the fields and display is here.
It is quite possible that the current edition matching algorithm that we use can be adapted to determine works in a way that approximates the OCLC results. This won't be as accurate as the OCLC algorithm, but we can use OCLC's FictionFinder database as a test set against which we can measure our results.
There are undoubtedly many different ways that we could design a database to support FRBR. Some possible designs are:
Note that based on the OCLC statistics, if we create a Work record for each work (even those that have a single edition) we will increase the number of records in the database by about 75%. Creating a Work record only when there are multiple editions, however, may add complexities to display.