Abstract
The Seattle Public Library (SPL) publishes anonymized, open-access checkout data for every item in its collection, dating from 2005 to the present. To our knowledge, it is the only U.S. library to release checkout data by title with this level of temporal detail: one dataset records exact timestamps for print book checkouts, while another provides monthly aggregates across all formats (e.g., ebooks, audiobooks, print books). Because U.S. book sales data is largely inaccessible outside the publishing industry, SPL’s open checkout data offers a rare and valuable alternative. But how well does it generalize beyond Seattle? Does it reflect book sales? And what can it tell us about readers more broadly?
In this paper, we introduce SPL’s checkout data and evaluate its potential for humanistic and literary research. We specifically assess how well it: (1) corresponds with book sales, (2) and extrapolates to library checkout patterns elsewhere nationally. First, we compare SPL data against publishing revenue reports and prior research with access to sales figures. We find that SPL patrons embrace digital books more than general consumers, but the overall distribution of checkouts resembles broader book sales patterns. Second, we compare SPL’s most checked out books per year to the New York Public Library’s annual top 10 lists. We find general overlap, but also distinct regional preferences, suggesting geographic extrapolation should be approached with caution. We conclude that SPL’s checkout data provides a rare window into library circulation and reading habits, with granular, time-series insights. However, its generalizability—particularly across regions and in relation to book sales—remains uncertain.
