Abstract
The YouTube Disinformation Comments Corpus provides 71,025 comments to 23 YouTube videos posted by US news organizations on the false claim that Haitian immigrants in Springfield, Ohio were stealing and eating neighborhood pets, which was picked up and amplified by the 2024 Republican presidential ticket of Donald Trump and J.D. Vance. Additionally, the YouTube Disinformation Comments Corpus features metadata associated with each YouTube video, including video titles, descriptions, transcripts, like counts, and YouTube content tags. This dataset is useful for people studying (1) natural language processing, (2) network textual analysis, (3) the circulation of dis- and misinformation on YouTube; (4) the reception of dis- and misinformation on YouTube; (5) news framing effects on YouTube and (6) public discussions of US immigration. TFIDF analysis is used to provide an overview of the variations across comment data.
