{"body": {"type": "/type/text", "value": "\r\n\r\n
\r\nThe thing called "FRBR-izing" is really the creation of a set of \r\n records that all represent the same work. So rather than having a display that \r\n shows all of the different editions of the work separately:
\r\n... you have a single display for the work, that links to all of the editions. \r\n Different systems display this differently. Here is the OCLC FictionFinder display:
\r\nAnd here is the beginning of the display of all of the many editions:
\r\nThis creates a two-tired database with Works and Editions. (Note: Editions \r\n are called "Manifestations" in library lingo.) The two big questions \r\n are:
\r\nNote that FRBR-ization affects only a small percentage of bibliographic records. \r\n OCLC's \r\n statistics show that 78% of the items in WorldCat are unique Works. Only \r\n 1% of Works have up to 7 Editions, and only 30,000 in their database have more than 20 Editions.
\r\n\r\n\r\nThere is no definitive answer to what is a work, especially when it comes to changes in format, such as a book that has become a screenplay and then is made into a movie. But since we only have books in the OL database at the moment, the task is somewhat simpler: bring together books that are essentially the same text. Basically, the elements that define a work are:
\r\nThis isn't quite as simple as it seems because ideally one would also bring together different translations of the same work, and of course those do not have the same title. In some records that we receive from libraries there will be a special "work title" that contains the original title of the work regardless of the language of the translation.
\r\n\r\n Mann, Thomas\r\n [Zauberberg]\r\n\tMagic Mountain.\r\n\t\r\n Mann, Thomas\r\n [Zauberberg]\r\n\tLa Montagna incantata.\r\n\r\n
There are also Works that are the same but have been printed with different \r\n titles at different times or in different countries, such as the works of Shakespeare \r\n and Harry Potter. The work titles (called "uniform titles" in library \r\n lingo) are unfortunately not used consistently even in library records, and \r\n don't exist at all in records from our other sources. At some point we will \r\n have to rely on users to bring together works that do not get identified algorithmically. \r\n We also have a set of ISBNs from LibraryThing to use, and could probably make \r\n some use of the xISBN service from OCLC. This, however, only helps us with works \r\n that have an ISBN.
\r\nIn terms of an algorithm, OCLC's work \r\n set algorithm is available. However, it makes use of some data elements \r\n that we will not have, in particular those that OCLC derived from LC Authority \r\n records.
\r\nThe Work-set display and the Edition display will make use of different fields. A page on the fields and display is here.\r\n\r\n
It is quite possible that the current edition matching algorithm that we use \r\n can be adapted to determine works in a way that approximates the OCLC results. \r\n This won't be as accurate as the OCLC algorithm, but we can use OCLC's FictionFinder \r\n database as a test set against which we can measure our results.
\r\n\r\n \r\nThere are undoubtedly many different ways that we could design a database to support FRBR. Some possible designs are:
\r\nNote that based on the OCLC statistics, if we create a Work record for each work (even those that have a single edition) we will increase the number of records in the database by about 75%.\tCreating a Work record only when there are multiple editions, however, may add complexities to display.
\r\n\r\n\r\n"}, "title": "FRBRization in the Open Library", "last_modified": {"type": "/type/datetime", "value": "2008-08-17 18:15:28.429732"}, "key": "/about/frbrization", "type": {"key": "/type/page"}, "id": 17867179, "revision": 3}