Have a personal or library account? Click to login
A Column Styled Composable Schema Matcher for Semantic Data-Types Cover

A Column Styled Composable Schema Matcher for Semantic Data-Types

Open Access
|Jun 2019

Figures & Tables

dsj-18-973-g1.png
Figure 1

REPSASM System Architecture.

dsj-18-973-g2.png
Figure 2

REPSASM Schema Matching Architecture.

dsj-18-973-g3.png
Figure 3

Classifying a Column Using the Match Tree.

Table 1

Column mapping between CKAN and CERIF.

Ais_source_of_has_classification_has_term
Bis_destination_of_has_source_is_source_of_has_destination_has_URI
Cis_dstination_of_has_source_is_source_of_has_destination_type
Dis_source_of_has_destination_type
Eis_destination_of_has_source_is_source_of_has_endDate
Fhas_identifier_is_source_of_has_endDate
Hhas_identifier_has_id_value
Iis_destination_of_has_source_has_identifier_has_URI
Jis_destination_of_has_source_has_identifier_type
Kis_destination_of_type
Mis_destination_of_has_source_is_source_of_has_destination_has_name
Nis_destination_of_has_source_type
Ois_destination_of_has_classification_type
Phas_identifier_has_URI
Qis_source_of_has_classification_type
Rhas_identifier_is_source_of_has_classification_type
Sis_destination_of_has_endDate
Tis_destination_of_has_startDate
Vis_destination_of_has_source_has_identifier_has_id_value
Xis_source_of_has_endDate
Yhas_identifier_is_source_of_has_startDate
ais_destination_of_has_source_is_source_of_has_classification_type
bis_destination_of_has_source_is_source_of_type
cis_destination_of_has_source_is_source_of_has_startDate
dhas_identifier_is_source_of_type
eis_source_of_has_startDate
has_descriptionhas_description
has_identifier_labelhas_identifier_label
has_identifier_typehas_identifier_type
has_namehas_name
is_source_of_typeis_source_of_type
labellabel
typetype
unknownunknown
dsj-18-973-g4.png
Figure 4

Number of instances per class in the CERIF learn set.

dsj-18-973-g5.png
Figure 5

Number of instances per class in the CKAN Test set.

dsj-18-973-g6.png
Figure 6

Inlier test on the CKAN-CERIF dataset. Accuracy was averaged over 5 tests with 31 classes. Number of simulated columns per class: 15.

dsj-18-973-g7.png
Figure 7

Outlier Test on the CKAN-CERIF dataset. Scores were averaged over 5 tests with 31 classes. Number of simulated columns per class: 15.

dsj-18-973-g8.png
Figure 8

Finalized Fitted Pipeline for the CKAN-CERIF dataset.

dsj-18-973-g9.png
Figure 9

Inlier test on the CKAN-CERIF dataset. Accuracy was averaged over 3 tests.

dsj-18-973-g10.png
Figure 10

Outlier test on the CKAN-CERIF dataset. Scores were averaged over 3 tests.

Language: English
Submitted on: Feb 23, 2019
|
Accepted on: May 14, 2019
|
Published on: Jun 24, 2019
Published by: Ubiquity Press
In partnership with: Paradigm Publishing Services
Publication frequency: 1 issue per year

© 2019 Xiaofeng Liao, Jordy Bottelier, Zhiming Zhao, published by Ubiquity Press
This work is licensed under the Creative Commons Attribution 4.0 License.