Domain Topic and Hidden Deep Web Data Extracting
Liming Du, Abdulhamid Yahaya, Gui Li, Fengying Wang, Jie Dong
Available Online May 2018.
- https://doi.org/10.2991/amcce-18.2018.151How to use a DOI?
- Web Data; Data extraction; Data Mining
- This paper mainly studies the method of extracting web data entities based on domain. Through the analysis of real estate industry websites, a topic-oriented topic extracting model is proposed, and the corresponding search strategy is given. In addition, for the case of depth information, a sorting-based classification extraction algorithm is designed for numerical data. Finally, an experimental example is given to verify the effectiveness of the algorithm.
- Open Access
- This is an open access article distributed under the CC BY-NC license.
Cite this article
TY - CONF AU - Liming Du AU - Abdulhamid Yahaya AU - Gui Li AU - Fengying Wang AU - Jie Dong PY - 2018/05 DA - 2018/05 TI - Domain Topic and Hidden Deep Web Data Extracting BT - 2018 3rd International Conference on Automation, Mechanical Control and Computational Engineering (AMCCE 2018) PB - Atlantis Press SP - 862 EP - 866 SN - 2352-5401 UR - https://doi.org/10.2991/amcce-18.2018.151 DO - https://doi.org/10.2991/amcce-18.2018.151 ID - Du2018/05 ER -