{"body": {"type": "/type/text", "value": "The Open Library project is an enormous undertaking. We can't do it without your help. \r\n\r\nIf you can help with anything do not hesitate to [contact us](/about/contact)!\r\n\r\nHere are some things we can use your help on, categorized by skill:\r\n\r\n## Share some computing power\r\n\r\nIf you have a computer with an Internet connection, run [LibraryThing's Google Book Search Search](http://www.librarything.com/blog/2007/09/google-book-search-on-librarything.php). It'll try to collect data about Google Book Search pages which will allow us to enhance the pages here on Open Library.\r\n\r\n**Note:** LibraryThing's Google Book Search Search is [currently disabled](http://www.librarything.com/blog/2007/09/google-book-search-search-again.php). But if you have a server with extra bandwidth that you'd like to contribute to Open Library-related projects, please [let us know](/about/contact).\r\n\r\n## Making phone calls\r\n\r\nA site like this requires massive amounts of data; we'll take whatever we can get our hands on. Most of the data in the book world is locked up behind contracts or access restrictions. By contrast, we put everything we can on the Internet for free. But there's more data than we can collect by ourselves. That's where you come in: you can help us by making the phone calls with various data providers to try and get us their data.\r\n\r\n### Libraries\r\n\r\nLots of libraries have large catalog collections stored as MARC data. If you can get your favorite friendly library to send us a dump of their data, we'll be forever indebted. There are a few [requirements](http://openlibrary.org/about/requirements) for data submitted to the Open Library. Status:\r\n\r\n* [Library of Congress:](http://openlibrary.org/about/loc) [imported](http://www.archive.org/details/marc_records_scriblio_net) (8M records)\r\n* [University of North Carolina](/about/unc): [uploaded](http://www.archive.org/details/unc_catalog_marc/) (4162599 good records, 12 bad)\r\n* [Oregon State University](/about/summit): [imported](http://www.archive.org/details/marc_oregon_summit_records/)\r\n* [Washington State University](/about/summit): [imported](http://www.archive.org/details/marc_oregon_summit_records/)\r\n* [Lewis and Clark](/about/summit): [imported](http://www.archive.org/details/marc_oregon_summit_records/)\r\n* [Oregon Health Science University](/about/summit): [imported](http://www.archive.org/details/marc_oregon_summit_records/)\r\n* [National College of Natural Medicine](/about/summit): [imported](http://www.archive.org/details/marc_oregon_summit_records/)\r\n* [Western States Chiropractic Community Library](/about/summit): [imported](http://www.archive.org/details/marc_oregon_summit_records/)\r\n* [Portland Community College](/about/summit): [imported](http://www.archive.org/details/marc_oregon_summit_records/)\r\n* University of Toronto: [imported](http://www.archive.org/details/marc_university_of_toronto) (6M records)\r\n* Miami University of Ohio: [imported](http://www.archive.org/details/marc_miami_univ_ohio) (2028337 records, parses okay)\r\n* [Western Washington University](/about/wwu): [imported](http://www.archive.org/details/marc_western_washington_univ)\r\n* [Boston College](/about/bc): [imported](http://www.archive.org/details/marc_boston_college) (2.1M records)\r\n* Montana State: received\r\n* Drexel: received\r\n* Laurentian: [imported](http://archive.org/details/marc_laurentian)\r\n* [Boston Public Library](http://openlibrary.org/about/BostonPL): [uploaded](http://www.archive.org/details/bpl_marc) (`wiki-beta:/2/pharos/bpl`)\r\n* Wayne State: unknown\r\n* [Talis](/about/talis): [imported!](http://www.archive.org/details/talis_openlibrary_contribution)\r\n* University of California: pursuit in progress (12M records)\r\n* British Library: to pursue\r\n* Bibliotheque Nationale de France: to pursue\r\n* Library and Archives Canada: to pursue\r\n* national libraries of other countries: to pursue\r\n* Libon online books (Italy) [translate from UNIMARC](libon)\r\n* Buffalo State College: [imported](http://www.archive.org/details/wfm_bk_marc/)\r\n* Collingswood Public Library: [imported] (http://archive.org/details/CollingswoodLibraryMarcDump10-27-2008/)\r\n* Harvard's HOLLIS catalog: [imported] (http://archive.org/details/hollis_marc/)\r\n* Ithaca College Library: [imported] (http://www.archive.org/details/marc_ithaca_college/)\r\n\r\nHaving trouble convincing your local library? See if the [Open Library Information Sheet](/about/ol_info_sheet) will answer their questions. \r\n[Worried about OCLC licensing? Read more about their agreement](http://demo.openlibrary.org/about/help/oclc).\r\n\r\n### Publishers\r\n\r\nMost publishers provide \"ONIX feeds\" -- XML export of all their information about the books they publish. Unfortunately, you generally need to call them and make a deal to get these feeds. If you can pick a favorite publisher and try to work out a deal for us, you'll be doing a service for the cause of freedom. Lots of presses are owned by larger companies, so you might want to follow the ownership ladder to make sure you end up with as many records as possible. Status:\r\n\r\n\r\n* HarperCollins: [imported](http://archive.org/details/catalog_harpercollins)\r\n* Simon and Schuster: [imported](http://archive.org/details/catalog_simonschuster)\r\n* Random House: [imported](http://www.archive.org/details/catalog_randomhouse)\r\n* Cambridge University Press: [imported](http://archive.org/details/catalog_cambridge)\r\n* Penguin: downloaded\r\n* Wiley: [downloaded](http://www.archive.org/details/catalog_wiley/)\r\n* Thomas Nelson: downloaded\r\n* O'Reilly: downloaded\r\n* Elseiver: received\r\n* Penn State University Press: received\r\n* [other publishers...](publishers)\r\n* [sample letter to publisher](/about/sample_publisher_letter)\r\n\r\n### Book data sources\r\n\r\nWe have a list of all the ISBNs we know about ([text](/static/isbns/isbns.txt.gz), [html](/static/isbns/isbns.html.gz)) -- if you can collect information about these ISBNs from other data sources, we'd love it.\r\n\r\n* Powells: [downloaded](http://www.archive.org/details/crawl_powells/)\r\n* Amazon: [crawl donated](http://www.archive.org/details/amazon_crawl/)\r\n\r\n\r\n### Book scans\r\n\r\nObviously, we'd also like scans of books:\r\n\r\n* Google: in October 2008, Open Library integrated 400-500K books into our system, retaining the original watermark. These books were not scanned by the Internet Archive, and we are not responsible for their quality.\r\n* OCA: downloaded\r\n* Million Books Project: _negotiations in progress_\r\n* BNF: downloaded\r\n\r\n### Copyright status\r\n\r\nIt's important to know whether a book has fallen into the public domain or not. First this requires data about the book's copyright registration and renewal:\r\n\r\n* **Canada:** We're working with Creative Commons which has an agreement with Access Copyright Canada providing us with data on 300K works from 1742 to the present.\r\n* **US:** \r\n * We've received a copy of [the Stanford copyright renewal database](http://collections.stanford.edu/copyrightrenewals/bin/page?forward=home) which has 250K records on US copyright renewal from 1950 to the 1995.\r\n * public.resource.org [has been crawling the copyright.gov database](http://bulk.resource.org/copyright/), this gets us copyright records since 1978.\r\n * Our own Sebastian Hammer has been parsed Project Gutenberg scans from 1950 to 1977, resulting in 150K records. (Aaron Swartz has the details.)\r\n * Our own Sebastian Hammer has crawled the copyright.gov database from 1978 to 1995, resulting in 600K records.\r\n\r\nIt also requires the development of algorithms for analyzing this information and calculating a current copyright status.\r\n\r\n* **Canada:** David Strauss has been working on an algorithm for us.\r\n* **US:** We have some legal advice from Creative Commons on what the algorithm is. There's also the famed [Hirtle rules](http://www.copyright.cornell.edu/training/Hirtle_Public_Domain.htm) ([more...](http://academiccopyright.typepad.com/academiccopyright/2004/09/index.html))\r\n\r\n### Swap sites\r\n\r\n* BookMooch: [inventory](http://api.bookmooch.com/api/all_inventory.txt), [wish list](http://api.bookmooch.com/api/all_wishlists.txt)\r\n* PaperBackSwap: [inventory](http://www.paperbackswap.com/pub/reports/all_inventory.txt), [wish list](http://www.paperbackswap.com/pub/reports/all_wishlists.txt)\r\n* More coming...\r\n\r\n### Popularity data\r\n\r\nIn a sea of books, it's nice to have some ways of seeing which ones are more \"important\" than others. You want your search engine to bring up _The Da Vinci Code_ before _The Da Vinci Method_, if only because the numbers say that's more likely what people want. And when you're importing a book that says its by David Eggers, you want to guess it's the author of _A Heartbreaking Work of Staggering Genius_ and not _Physical Chemistry: A Textbook_. Similarly, good popularity data can keep you from recommending people things like _Harry Potter_.\r\n\r\n* Amazon.com salesrank: proprietary\r\n* Bookscan sales numbers: proprietary\r\n* Library circulation data: to pursue\r\n* LibraryThing: [received](http://www.librarything.com/feeds/)\r\n* Web mentions: to pursue\r\n* Store/library availability: to pursue (see price checker below)\r\n* our page view data: once we get more popular\r\n\r\n### Respect data\r\n\r\nStrict popularity isn't the only thing that matters. We also want to know whether the book is respected, even if it's not a bestseller.\r\n\r\nFor example, if you happen to have a copy of the Book Review Digest CD-ROM, we'd be massively indebted. The following libraries [supposedly have a copy](http://worldcat.org/oclc/30590649):\r\n\r\n* Los Angeles Public Library (we called them and they can't find their copy; it might be worth bothering them more)\r\n* Mohave County Library District (Kingman, AZ)\r\n* Southern Illinois University (Carbondale, IL)\r\n* World Book (Chicago, IL)\r\n* Indianapolis Public Library\r\n* Library of Congress\r\n* Pentagon Library\r\n* Saint Augustine's College (Raleigh, NC)\r\n* Nova Southeastern University (Ft. Lauderdale, FL)\r\n* US Military Academy (West Point, NY)\r\n* Universit\u00e9 de Montr\u00e9al (Montreal, QC)\r\n* Univ. of Nottingham (Nottingham, UK)\r\n* University of Hong Kong (Pokfulam, HK)\r\n* North West Univ. Potchefstroom Campus (NW, South Africa)\r\n* [Indonesian national library](http://www.pnri.go.id/official_v2005.5/catalogs/collections/data_cd/index.asp?box=detail&id=200504291025&from_box=list&page=4)\r\n* Williams College (Western Massachusetts) [Stetson library](http://williams.edu/Religion/references.html)\r\n* [National University of Singapore Library](http://bweb.nus.edu.sg/ecoll/db/dbinfo.asp?ID=99)\r\n\r\nOther collections of reviews include:\r\n\r\n* Book Review Index\r\n* major journals (LJ, PW, Booklist)\r\n* intra-book citations\r\n* awards\r\n\r\nIs there an online review aggregation site for books? Metacritic only does a handful.\r\n\r\n### Inter-book relations\r\n\r\n* ThingISBN: [available as full dumps](http://www.librarything.com/thingology/2007/03/thingisbn-data-in-one-file.php)\r\n* xISBN: costs money, uses only published algorithms\r\n* FRBRizing algorithms: [first draft of a merge algorithm](http://www.kcoyle.net/temp/merge.html)\r\n\r\n### Copyright information\r\n\r\n* Registration and renewal records\r\n* Orphaned works data (from users and other archivists)\r\n* Our own registration service\r\n\r\n### Other\r\n\r\nIf you know other people with library data, we'd love that too.\r\n\r\n## Library science\r\n\r\n[See our page on librarianship](/about/lib) for things you can help with.\r\n\r\n## Design\r\n\r\nWe want our site to look as good as possible, which means we're always interested in new designs. Luckily, Infogami has a powerful templating system that allows you to create your own look and feel for the site. For more information, read [the guide to our wiki language](http://demo.openlibrary.org/dev/docs/wikilanguage). [Let us know](/about/contact) if you have questions or have developed a nifty new template.\r\n\r\n## Programming\r\n\r\nLots of our work here requires programming. If you have a lot of time, you can hop right in and become a serious developer on Infogami or Open Library. (Check out [our bug list](https://bugs.launchpad.net/openlibrary/+bugs?field.searchtext=&orderby=-importance&search=Search&field.status%3Alist=New&field.status%3Alist=Incomplete&field.status%3Alist=Confirmed&field.status%3Alist=In+Progress&assignee_option=any&field.assignee=&field.bug_reporter=&field.bug_contact=&field.omit_dupes.used=&field.omit_dupes=on&field.has_patch.used=&field.tag=&field.has_cve.used=).) But if you have less time, perhaps you can pick up a smaller project:\r\n\r\n### Price check\r\n\r\nWe want Open Library to be a hub for all the book information on the Internet. As part of that, we're developing plugins to grab data from other sites. For example, we'd like code that checks prices at various stores and lists them, code that sees which libraries a book is available at near a zip code, code which checks to see if a book is available in a Borders near a bookstore, etc. We'd also like code that sees whether a book is available at particular libraries or on particular book trading sites (like bookmooch).\r\n\r\n### Export\r\n\r\nWe'd love to have our data exported in RDF/XML, database dumps, OAI, microformats, Z39.50, a cover repository API."}, "title": "How You Can Help", "last_modified": {"type": "/type/datetime", "value": "2009-03-04T04:57:58.343581"}, "key": "/about/help", "type": {"key": "/type/i18n_page"}, "id": 9888066, "revision": 24}