
Mini Worldlit: A Dataset of Contemporary Fiction from 13 Countries, Nine Languages, and Five Continents
Andrew Piper, David Bamman, Christina Han, Jens Bjerring-Hansen, Hoyt Long, Itay Marienberg-Milikowsky, Tom McEnaney, Mathias Iroro Orhero, Emrah Peksoy, Pallavi Rastogi, Sebastian Rasmussen, Roel Smeets, Alexandra Stuart, Mads Rosendahl Thomsen

The CONLIT Dataset of Contemporary Literature
Andrew Piper

The TRANSCOMP Dataset of Literary Translations from 120 Languages and a Parallel Collection of English-language Originals
Matt Erlin, Andrew Piper, Douglas Knox, Stephen Pentecost, Allie Blank

MultiHATHI: A Complete Collection of Multilingual Prose Fiction in the HathiTrust Digital Library
Sil Hamilton, Andrew Piper

HATHI 1M: Introducing a Million Page Historical Prose Dataset in English from the Hathi Trust
Sunyam Bagga, Andrew Piper