It looks like you're offline.
Open Library logo
additional options menu
Last edited by Drini
1 day ago | History

Open Library Data Dumps

Open Library provides dumps of all its data, generated every month. Most of the data dumps are formatted as tab separated files with the following columns:

Dumps

Coming soon:

For past dumps, see: https://archive.org/details/ol_exports?sort=-publicdate

Downloading the dumps take too long? Checkout the link above and download via torrent for higher speeds!

Format of JSON records

A JSON schema for the various types is located at https://github.com/internetarchive/openlibrary-client/tree/master/olclient/schemata

Using Open Library Data Dumps

This guide by contributor on the LibrariesHacked GitHub about how to load Open Library's data dumps into PostgreSQL to make it more easily queriable:
https://github.com/LibrariesHacked/openlibrary-search

DuckDB

DuckDB is another easy tool to query the dump without much work.

For example:
If you wanted to get all the Wikidata IDs currently in the authors table:
`<br /> SELECT json_extract(column4, '$.remote_ids.wikidata') as wikidata_id<br /> FROM read_csv('ol_dump_authors_2024-07-31.txt.gz') <br /> WHERE wikidata_id IS NOT NULL <br /> LIMIT 100;<br /> `

GraphQL

DiFronzo on GitHub has produced a GraphQL proxy to search books using work, edition and ISBN with the Open Library API. Deployed with Deno and GraphQL:

https://github.com/DiFronzo/OpenLibrary-GraphQL
DiFronzo/OpenLibrary-GraphQL

OL Covers Dump

We do not yet have rolling monthly dumps of our book covers, despite a shared desire for their existence. Some historical cover dumps may be explored here:
https://archive.org/details/ol_data?tab=collection&query=identifier%3Acovers&sort=-addeddate

Most covers are archived in the following items. Note covers_0006 and covers_0007 are presently unavailable.

History

1 day ago Edited by Drini Edited without comment.
1 day ago Edited by Drini Add coming soon wikidata link
January 10, 2025 Edited by raybb fix typos
January 4, 2025 Edited by raybb add DuckDB note
December 14, 2011 Created by Anand Chitipothu Documented Open Library Data Dumps