Have a personal or library account? Click to login
Analysing the Methods of Dzongkha Word Segmentation Cover

Analysing the Methods of Dzongkha Word Segmentation

Open Access
|Jun 2017

Abstract

In both Chinese and Dzongkha languages, the greatest challenge is to identify the word boundaries because there are no word delimiters as it is in English and other Western languages. Therefore, preprocessing and word segmentation is the first step in Dzongkha language processing, such as translation, spell-checking, and information retrieval. Research on Chinese word segmentation was conducted long time ago. Therefore, it is relatively mature, but the Dzongkha word segmentation has been less studied by researchers. In the paper, we have investigated this major problem in Dzongkha language processing using a probabilistic approach for selecting valid segments with probability being computed on the basis of the corpus.

DOI: https://doi.org/10.1515/acss-2017-0008 | Journal eISSN: 2255-8691 | Journal ISSN: 2255-8683
Language: English
Page range: 61 - 65
Published on: Jun 13, 2017
Published by: Riga Technical University
In partnership with: Paradigm Publishing Services
Publication frequency: 1 issue per year

© 2017 Parshu Ram Dhungyel, Jānis Grundspeņķis, published by Riga Technical University
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.