Have a personal or library account? Click to login
Qidian-Webnovel Corpus: A Dataset of Chinese Web Novels with Multilingual Reader Response Cover

Qidian-Webnovel Corpus: A Dataset of Chinese Web Novels with Multilingual Reader Response

By: Ze Yu,  Federico Pianzola and  Emin Tatar  
Open Access
|Dec 2025

Figures & Tables

Table 1

Metadata of Stories in the corpus.

SOURCECOMMENT IDCOMMENT CONTENTREPLY IDREPLY CONTENTBOOK IDUSER IDUSER LEVELRATING SCOREREPLY AMOUNTLIKE AMOUNTCREATE TIMEQUOTE REVIEW IDQUOTE CONTENTQUOTE USER ID
Webnovel (EN)
Qidian (CN)
Table 2

Metadata for comments and replies.

SOURCEGENRESCATEGORY/TAGTOTAL COMMENTSREPLIESPRIMARY LANGUAGECOMMENTS LANGUAGE DISTRIBUTIONREPLIES LANGUAGE DISTRIBUTION
Qidian (CN)14272,791,837855,577ChineseChinese: 95.7%; English: 0.1%Chinese: 97.2%; English: 0.05%
Webnovel (EN)840327,98896,250EnglishEnglish: 72.7%; Others: 27.3%English: 68.2%; Others: 31.8%
Table 3

Reader profile metadata.

SOURCEUSER IDUSER NAMEGENDERLEVEL INFOWRITING DAYSREADING HOURSNUM READ BOOKSDESCRIPTIONDATE JOINEDLOCATIONNUM FOLLOWERS
Webnovel (EN)
Qidian (CN)
Table 4

Demographic Information.

LOCATIONNUMBERPERCENTAGE (%)
Global55,10042.80
United States15,89112.30
Philippines14,42511.20
India9,5917.40
Indonesia3,2312.50
Nigeria2,9722.30
Malaysia2,2771.70
Canada2,0461.50
Australia1,6021.20
United Kingdom1,5841.20
Brazil1,4781.10
DOI: https://doi.org/10.5334/johd.368 | Journal eISSN: 2059-481X
Language: English
Submitted on: Aug 4, 2025
Accepted on: Oct 23, 2025
Published on: Dec 12, 2025
Published by: Ubiquity Press
In partnership with: Paradigm Publishing Services
Publication frequency: 1 issue per year

© 2025 Ze Yu, Federico Pianzola, Emin Tatar, published by Ubiquity Press
This work is licensed under the Creative Commons Attribution 4.0 License.