Semantic Schema Matching for String Attribute with Word Vectors and its Evaluation
- Kenji Nozaki1, *, Teruhisa Hochin2, Hiroki Nomiya21Graduate School of Information Science, Kyoto Institute of Technology, Matsugasaki, Sakyo-ku, Kyoto 606-8585, Japan2Faculty of Information and Human Sciences, Kyoto Institute of Technology, Matsugasaki, Sakyo-ku, Kyoto 606-8585, Japan*Corresponding author. Email: firstname.lastname@example.org
- Corresponding Author
- Kenji Nozaki
- https://doi.org/10.2991/ijndc.k.190710.001How to use a DOI?
- Instance-based schema matching, schema matching, semantic matching, Word2Vec
Instance-based schema matching is to determine the correspondences between heterogeneous databases by comparing instances. Heterogeneous databases consist of an enormous number of tables containing various attributes, causing the data heterogeneity. In such cases, it is effective to consider semantic information. In this paper, we propose the instance-based schema matching considering attributes’ semantics. We used Word2Vec to match attributes of character strings. The result shows a possibility to detect matching between attributes with high semantic similarity.
- © 2019 The Authors. Published by Atlantis Press SARL.
- Open Access
- This is an open access article distributed under the CC BY-NC 4.0 license (http://creativecommons.org/licenses/by-nc/4.0/).
Cite this article
TY - JOUR AU - Kenji Nozaki AU - Teruhisa Hochin AU - Hiroki Nomiya PY - 2019 DA - 2019/08 TI - Semantic Schema Matching for String Attribute with Word Vectors and its Evaluation JO - International Journal of Networked and Distributed Computing SP - 100 EP - 106 VL - 7 IS - 3 SN - 2211-7946 UR - https://doi.org/10.2991/ijndc.k.190710.001 DO - https://doi.org/10.2991/ijndc.k.190710.001 ID - Nozaki2019 ER -