Have a personal or library account? Click to login
Graph-Based Complex Representation in Inter-Sentence Relation Recognition in Polish Texts Cover

Graph-Based Complex Representation in Inter-Sentence Relation Recognition in Polish Texts

Open Access
|Mar 2018

Abstract

This paper presents a supervised approach to the recognition of Cross-document Structure Theory (CST) relations in Polish texts. Its core is a graph-based representation constructed for sentences. Graphs are built on the basis of lexicalised syntactic-semantic relations extracted from text. Similarity between sentences is calculated as similarity between their graphs, and the values are used as features to train the classifiers. Several different configurations of graphs, as well as graph similarity methods were analysed for this task. The approach was evaluated on a large open corpus annotated manually with 17 types of selected CST relations. The configuration of experiments was similar to those known from SEMEVAL and we obtained very promising results.

DOI: https://doi.org/10.2478/cait-2018-0013 | Journal eISSN: 1314-4081 | Journal ISSN: 1311-9702
Language: English
Page range: 152 - 170
Submitted on: Oct 20, 2017
Accepted on: Jan 31, 2018
Published on: Mar 30, 2018
Published by: Bulgarian Academy of Sciences, Institute of Information and Communication Technologies
In partnership with: Paradigm Publishing Services
Publication frequency: 4 issues per year

© 2018 Arkadiusz Janz, Paweł Kędzia, Maciej Piasecki, published by Bulgarian Academy of Sciences, Institute of Information and Communication Technologies
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 3.0 License.