Abstract
The Journal Digital Corpus (JDC) is a corpus comprising transcriptions of Swedish historical newsreels, primarily sourced from the SF Veckorevy newsreels produced between the early 1910s and the 1960s. JDC includes transcribed speech from 2,553 newsreels (over two million words) and intertitles from 4,333 videos. Utilizing custom-built Python libraries, SweScribe and stum, the corpus facilitates unprecedented access to historical narratives of Swedish modernity. It offers extensive research opportunities across history, cultural studies, linguistics, and media analysis, enabling detailed examinations of societal shifts, media representation, and linguistic developments throughout twentieth-century Sweden.
