Have a personal or library account? Click to login
Chinese Language Word Embeddings Based on the Corpus Hanku Cover

Chinese Language Word Embeddings Based on the Corpus Hanku

Open Access
|Aug 2022

Abstract

Vector models based on word embeddings are an indispensable part of advanced Natural Language Processing research and language analysis. We describe several Chinese language (Pǔtōnghuà) word embeddings, the differences from “western” language models caused by specific orthographic and linguistic features of the written Chinese language, and introduce a publicly available web interface for querying the vector models, aimed at linguistically or pedagogically oriented users.

DOI: https://doi.org/10.2478/jazcas-2022-0023 | Journal eISSN: 1338-4287 | Journal ISSN: 0021-5597
Language: English
Page range: 996 - 1004
Published on: Aug 17, 2022
Published by: Slovak Academy of Sciences, Mathematical Institute
In partnership with: Paradigm Publishing Services
Publication frequency: 2 issues per year

© 2022 Radovan Garabík, published by Slovak Academy of Sciences, Mathematical Institute
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.