Open Library provides dumps of all the data in various formats. Currently these dumps are generated every month.
OL Dump
This contains dump of latest editions of all the records in Open Library. This is a tab separated file with the following columns:
-
type - type of record (/type/edition, /type/work etc.)
-
key - unique key of the record. (/books/OL1M etc.)
-
revision - revision number of the record
-
last_modified - last modified timestamp
- JSON - the complete record in JSON format
This dump can be downloaded from:
http://openlibrary.org/data/ol_dump_latest.txt.gz
For convenience, this dump is split into multiple files based on type.
-
editions dump - http://openlibrary.org/data/ol_dump_editions_latest.txt.gz
-
works dump - http://openlibrary.org/data/ol_dump_works_latest.txt.gz
- authors dump - http://openlibrary.org/data/ol_dump_authors_latest.txt.gz
OL Complete Dump
This contains dump of all revisions of all the records in Open Library. Format is same as the OL dump.
This dump can be downloaded from:
http://openlibrary.org/data/ol_cdump_latest.txt.gz
Format of JSON records
Author Record
:TODO:
Edition
:TODO:
Work
:TODO:
History
- Created December 14, 2011
- 24 revisions
March 22, 2024 | Edited by raybb | Edited without comment. |
October 8, 2023 | Edited by raybb | update dump sizes |
February 3, 2023 | Edited by Tom Morris | Update sizes for dumps of main entities |
November 17, 2021 | Edited by raybb | update file sizes |
December 14, 2011 | Created by Anand Chitipothu | Documented Open Library Data Dumps |