Have a personal or library account? Click to login
A Variant Character Dataset for Historical Narratives of Middle and Late Imperial China Cover

A Variant Character Dataset for Historical Narratives of Middle and Late Imperial China

By: Jiwon Lee and  Youngim Jung  
Open Access
|May 2025

References

  1. 1Anderl, C. (2020). Some reflections on the Database of Medieval Chinese Texts as a multi-purpose tool for research, teaching, and international collaboration. In B. Basciano, F. Gatti, & A. Morbiato (Eds.), Corpus-based research on Chinese language and linguistics (pp. 341360). Edizioni Ca’ Foscari, Venezia. 10.30687/978-88-6969-406-6/011
  2. 2Farina, A., Marongiu, P., & Rodda, M. A. (2024). Editorial: Representing the Ancient World through Data. Journal of Open Humanities Data, 10(57), 16. 10.5334/johd.245
  3. 3Kessler, F. (2024). Towards context-aware normalization of variant characters in Classical Chinese using parallel editions and BERT. In Proceedings of the 1st Workshop on Machine Learning for Ancient Languages (ML4 AL 2024) (pp. 141151). Association for Computational Linguistics, Stroudsburg, PA. 10.18653/v1/2024.ml4al-1.15
  4. 4Lee, J. (2024). A study on the intertextuality of The Story of Sui and Tang Dynasties using a text reuse detection algorithm. Chinese Literature, 120, 137155. 10.21192/scll.120.202408.007
  5. 5The Ministry of Education of the Republic of China. (1982). Table of standard forms of common national characters. Ministry of Education of the Republic of China. Retrieved January 30, 2025, from https://ws.moe.edu.tw/001/Upload/6/relfile/6490/38921/d190213c-7af8-45bf-b70e-48b4469aad72.pdf
  6. 6The Ministry of Education of the Republic of China. (2017). Table of standard forms of less-common national characters. In The dictionary of Chinese variant characters. Ministry of Education of the Republic of China. Retrieved January 30, 2025, from https://zh.wikisource.org/zh-hant/%E6%AC%A1%E5%B8%B8%E7%94%A8%E5%9C%8B%E5%AD%97%E6%A8%99%E6%BA%96%E5%AD%97%E9%AB%94%E8%A1%A8
  7. 7The Ministry of Education of the Republic of Korea. (2007). Basic Chinese characters for education. Ministry of Education of the Republic of Korea. Retrieved January 30, 2025, from https://www.textbook.or.kr/boardEditStd/filedownload.do?bfId=15&bId=22
  8. 8The State Council of the People’s Republic of China. (2013, August 19). Notification of the State Council on the publication of the “Table of General Standard Chinese Characters”. The State Council of the People’s Republic of China. Retrieved January 29, 2025, from https://www.gov.cn/zwgk/2013-08/19/content_2469793.htm
  9. 9Unicode Consortium. (2024, July 31). Unihan_Variants.txt (Version 16.0.0) [Unihan.zip]. Retrieved January 26, 2025, from https://www.unicode.org/Public/UNIDATA
DOI: https://doi.org/10.5334/johd.325 | Journal eISSN: 2059-481X
Language: English
Submitted on: Mar 12, 2025
Accepted on: Apr 19, 2025
Published on: May 21, 2025
Published by: Ubiquity Press
In partnership with: Paradigm Publishing Services
Publication frequency: 1 issue per year

© 2025 Jiwon Lee, Youngim Jung, published by Ubiquity Press
This work is licensed under the Creative Commons Attribution 4.0 License.