International Journal of Networked and Distributed Computing

Volume 7, Issue 3, August 2019, Pages 100 - 106

Semantic Schema Matching for String Attribute with Word Vectors and its Evaluation

Authors
Kenji Nozaki1, *, Teruhisa Hochin2, Hiroki Nomiya2
1Graduate School of Information Science, Kyoto Institute of Technology, Matsugasaki, Sakyo-ku, Kyoto 606-8585, Japan
2Faculty of Information and Human Sciences, Kyoto Institute of Technology, Matsugasaki, Sakyo-ku, Kyoto 606-8585, Japan
*Corresponding author. Email: ken23mybgbc2@gmail.com
Corresponding Author
Kenji Nozaki
Received 7 February 2019, Accepted 6 May 2019, Available Online 5 August 2019.
DOI
https://doi.org/10.2991/ijndc.k.190710.001How to use a DOI?
Keywords
Instance-based schema matching, schema matching, semantic matching, Word2Vec
Abstract

Instance-based schema matching is to determine the correspondences between heterogeneous databases by comparing instances. Heterogeneous databases consist of an enormous number of tables containing various attributes, causing the data heterogeneity. In such cases, it is effective to consider semantic information. In this paper, we propose the instance-based schema matching considering attributes’ semantics. We used Word2Vec to match attributes of character strings. The result shows a possibility to detect matching between attributes with high semantic similarity.

Copyright
© 2019 The Authors. Published by Atlantis Press SARL.
Open Access
This is an open access article distributed under the CC BY-NC 4.0 license (http://creativecommons.org/licenses/by-nc/4.0/).

Download article (PDF)
View full text (HTML)

Journal
International Journal of Networked and Distributed Computing
Volume-Issue
7 - 3
Pages
100 - 106
Publication Date
2019/08
ISSN (Online)
2211-7946
ISSN (Print)
2211-7938
DOI
https://doi.org/10.2991/ijndc.k.190710.001How to use a DOI?
Copyright
© 2019 The Authors. Published by Atlantis Press SARL.
Open Access
This is an open access article distributed under the CC BY-NC 4.0 license (http://creativecommons.org/licenses/by-nc/4.0/).

Cite this article

TY  - JOUR
AU  - Kenji Nozaki
AU  - Teruhisa Hochin
AU  - Hiroki Nomiya
PY  - 2019
DA  - 2019/08
TI  - Semantic Schema Matching for String Attribute with Word Vectors and its Evaluation
JO  - International Journal of Networked and Distributed Computing
SP  - 100
EP  - 106
VL  - 7
IS  - 3
SN  - 2211-7946
UR  - https://doi.org/10.2991/ijndc.k.190710.001
DO  - https://doi.org/10.2991/ijndc.k.190710.001
ID  - Nozaki2019
ER  -