Proceedings of the 3rd International Conference on Wireless Communication and Sensor Networks (WCSN 2016)

A Web Page Classification Method Based on TCP/IP Header Features

Authors
Di Huang, Xin-Yi Zhang, Qi-Wei Tang
Corresponding Author
Di Huang
Available Online December 2016.
DOI
https://doi.org/10.2991/icwcsn-16.2017.14How to use a DOI?
Keywords
web page classification; packet header feature; instance-based learning
Abstract
Web page classification has wide applications. Due to various types of web pages and vast amounts of network traffic, it is difficult to classify web pages by deeply inspecting the content of each packet. This paper presents a learning-based classification method according to TCP/IP header features. First, we propose an approach to select features and improve the Relief algorithm, which can pick features with robustness. Then we raise a labeling strategy to assign each feature with a label when training the classifier. Last, we put forward a learning-based classification method which takes labels and multi-layer semantics into consideration. The experiment results show that the proposed strategy can improve the processing speed and the accuracy of classification.
Open Access
This is an open access article distributed under the CC BY-NC license.

Download article (PDF)

Proceedings
Part of series
Advances in Computer Science Research
Publication Date
December 2016
ISBN
978-94-6252-302-9
ISSN
2352-538X
DOI
https://doi.org/10.2991/icwcsn-16.2017.14How to use a DOI?
Open Access
This is an open access article distributed under the CC BY-NC license.

Cite this article

TY  - CONF
AU  - Di Huang
AU  - Xin-Yi Zhang
AU  - Qi-Wei Tang
PY  - 2016/12
DA  - 2016/12
TI  - A Web Page Classification Method Based on TCP/IP Header Features
PB  - Atlantis Press
SP  - 61
EP  - 64
SN  - 2352-538X
UR  - https://doi.org/10.2991/icwcsn-16.2017.14
DO  - https://doi.org/10.2991/icwcsn-16.2017.14
ID  - Huang2016/12
ER  -