A Web Page Classification Method Based on TCP/IP Header Features
Di Huang, Xin-Yi Zhang, Qi-Wei Tang
Available Online December 2016.
- https://doi.org/10.2991/icwcsn-16.2017.14How to use a DOI?
- web page classification; packet header feature; instance-based learning
- Web page classification has wide applications. Due to various types of web pages and vast amounts of network traffic, it is difficult to classify web pages by deeply inspecting the content of each packet. This paper presents a learning-based classification method according to TCP/IP header features. First, we propose an approach to select features and improve the Relief algorithm, which can pick features with robustness. Then we raise a labeling strategy to assign each feature with a label when training the classifier. Last, we put forward a learning-based classification method which takes labels and multi-layer semantics into consideration. The experiment results show that the proposed strategy can improve the processing speed and the accuracy of classification.
- Open Access
- This is an open access article distributed under the CC BY-NC license.
Cite this article
TY - CONF AU - Di Huang AU - Xin-Yi Zhang AU - Qi-Wei Tang PY - 2016/12 DA - 2016/12 TI - A Web Page Classification Method Based on TCP/IP Header Features PB - Atlantis Press SP - 61 EP - 64 SN - 2352-538X UR - https://doi.org/10.2991/icwcsn-16.2017.14 DO - https://doi.org/10.2991/icwcsn-16.2017.14 ID - Huang2016/12 ER -