
Figure 1
Number of books by publication date. The preprocessed dataset has 47,685 books in English consisting of ≈5.1 billion tokens. The red vertical dashed lines mark the boundaries between the time periods we used to slice the dataset. See Section 2.2 for details.
