It looks like you're using Internet Explorer 11 or older. This website works best with modern browsers such as the latest versions of Chrome, Firefox, Safari, and Edge. If you continue with this browser, you may see unexpected results.
As much care as we put into our local discovery tools -- and we certainly encourage you to use them -- we recognize that the best uses of our data may come from those who remix and re-publish these data in novel and interesting ways. We therefore welcome efforts to devise new ways of exploring and accessing our collections and classification systems, including building one’s own discovery tools and services.
Toward this end, we release our metadata as openly and widely as possible. We avoid placing restrictions on reuse, except in cases of ethical, contractual or legal obligations, e.g., metadata received on condition that we not share them further, or metadata for which sharing might compromise user privacy. This service is in line with Yale’s broader support for open access, including the high resolution digital images made available through our cross-collection discovery portal, and open scholarly communication enabled through our EliScholar platform.
Most metadata generated by the Library will be open, by default, for sharing and reuse, and released with a public domain Creative Commons CCO license. Records derived from the shared OCLC WorldCat database are made available as Open Data Commons ODC-BY with a credit to OCLC. In all other cases, we will attempt to negotiate terms that allow maximal sharing and reuse, as we do with our locally produced records, and label them accordingly.
Modes of Access
The Library intends to make it as easy as possible to download or query our data, both by human agents and machines. Below, we provide brief descriptions and access options for currently available datasets. These are followed by datasets under current review, followed by datasets of possible interest for the future.
Datasets currently available
Orbis (Yale catalog) bibliographic, holdings, items, and authority records
Bulk MARC downloads. Files are updated daily and refreshed every Sunday. The output includes bibliographic, holdings, and item data.
Where appropriate, we embed sharing rights directly into the records themselves, e.g.:
"500 \\$aThis WorldCat-derived record is shareable under Open Data Commons ODC-BY, with attribution to OCLC.$5CTY"
"500 \\$aThis Yale-originated record is shareable under Creative Commons license CC0.$5CTY"
Note the following file-naming conventions, where 'type' is either 'aut' or 'bib', yyyymmdd is the run date, 'run' values can be 'full' or 'incr,' and 'nnn' is the file sequence number. Here are the standard file types:
bib_yyyymmdd_run.txt: bibliographic record identifier list
type_yyyymmdd_run: directory containing output MARC record files,
type_yyyymmdd_run_nnn.mrc: output MARC record files
mrc_yyyymmdd_run.tsv: output file and record inventory
type_yyyymmdd_del.txt: record identifiers to be deleted when running an update